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Molecular machines consist of either a single protein or a macromolecular complex composed of 
protein and RNA molecules. Just like their macroscopic counterparts, each of these nano-machines 
has an engine that "transduces" input energy into an output form which is then utilized by its cou- 

^-^ . pling to a transmission system for appropriate operations. The theory of heat engines, pioneered by 

»vj ' Carnot, rests on the second law of equilibrium thermodynamics. However, the engines of molecular 

machines, operate under isothermal conditions far from thermodynamic equilibrium. Moreover, one 
of the possible mechanisms of energy transduction, popularized by Feynman and called Brownian 
ratchet, does not even have any macroscopic counterpart. But, molecular machine is not synony- 
mous with Brownian ratchet; a large number of molecular machines actually execute a noisy power 
stroke, rather than operating as Brownian ratchet. The man-machine analogy, a topic of intense 
philosophical debate in which many leading philosophers like Aristotle and Descartes participated, 
was extended to similar analogies at the cellular and subcellular levels after the invention of opti- 
cal microscope. The idea of molecular machine, pioneered by Marcelo Malpighi, has been pursued 

,S^ • vigorously in the last fifty years. It has become a well established topic of current interdisciplinary 

^1^' research as evident from the publication of a very infiuential paper by Alberts towards the end of 

the twentieth century. Here we give a non-technical overview of the strategies for (a) stochastic 
modeling of mechano-chemical kinetic processes, and (b) model selection based on statistical in- 
ference drawn from analysis of experimental data. It is written for non-experts and from a broad 
jyj ' perspective, showing overlapping concepts from several different branches of physics and from other 

^ , areas of science and technology. 
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I. INTRODUCTION 



Cell is the structural and functional units of life. How 
"active" is the interior of a living cell? Imagine an un- 
der water "metro city" which is, however, only about 
10/J,m long in each direction! In this city, there are "high- 
ways" and "railroad" tracks on which motorized "ve- 
hicles" transport cargo to various destinations. It has 
an elaborate mechanism of preserving the integrity of 
the chemically encoded blueprint of the construction and 
maintenance of the city. The "factories" not only sup- 
ply their products for the construction and repair works, 
but also manufacture the components of the machines. 
This eco-friendly city re-charges spent "chemical fuel" in 
uniquely designed "power plants". This city also uses 
a few "alternative energy" sources in some operations. 
Finally, it has special "waste-disposal plants" which de- 



grade waste into products that are recycled as raw mate- 
rials for fresh synthesis. All the automated processes in 
this high-tech micro-city are run by nano-machines [l|. 
This is not the plot of a science fiction, but a dramatized 
picture of the dynamic interior of a cell. 

A molecular machine is either a single protein or a 
macromolecular complex consisting of proteins and RNA 
molecules [2]. Just like their macroscopic counterparts, 
these nano-machines take an input (most often, chemical 
energy) and it transduces input energy into an output. If 
the output is mechanical work the machine is usually re- 
ferred to as a molecular motor [3-[a] . Similarly, a cell can 
also be regarded as a micron-size "energy transforming 
device" . The concept of "machine" is not restricted only 
to subcellular or cellular levels of biological organization. 
In ancient times, philosophers proposed the concept of 
"living machine" to describe a whole animal, including a 



man; we'll discuss the evolution of this concept towards 
the end of this article. 

Not only animals, but even plants also move in re- 
sponse to external stimuli [6|, |7| and movements take 
place in plants at all levels- from whole plant and plant 
cell to the subcellular level. I cannot resist the tempta- 
tion of quoting Thomas Huxley's poetic description of cy- 
toplasmic streaming [8] , in particular, and of intracellular 
motor traffic, in general, in plants [9|: "..the wonderful 
noonday silence of a tropical forest is, after all, due only 
to the dulness of our hearing; and could our ears catch 
the murmur of these tiny maelstroms, as they whirl in the 
innumerable myriads of living cells which constitute each 
tree, we should be stunned, as with the roar of a great 
citf . 

In this article, however, we focus attention almost ex- 
clusively on molecular machines that drive subcellular 
processes within living cells. The processes driven by 
molecular machines include not only intracellular mo- 
tor transport, but also manipulation, polymerization and 
degradation of the bio- molecules [Ifl, [ill . Explaining the 
physical principles that govern these machine-driven pro- 
cesses will bring us closer to the ultimate goal of biologi- 
cal physics: understanding "what is life" in terms of the 
laws of physics (and chemistry). 

Biomolecular machines operate in a domain where the 
appropriate units of length, time, force and energy are, 
nano-meter, milli-second, pico-Newton and fcsT, respec- 
tively (ks being the Boltzmann constant and T is the ab- 
solute temperature). Aren't the operational mechanism 
of molecular machines similar to their macroscopic coun- 
terparts except, perhaps, the difference of scale? NO. In 
spite of the striking similarities, it is the differences be- 
tween molecular machines and their macroscopic coun- 
terparts that makes the studies of these systems so inter- 
esting from the perspective of physicists. 

This article is a brief guide for a beginner embark- 
ing on an exploration of the exciting frontier of research 
on molecular machines. It begins with a catalog of 
the known molecular machines and a list of fundamen- 
tal questions on their operational mechanism that the 
explorer should try to address. Like any travel guide, 
this article also cautions the explorer about the counter- 
intuitive phenomena that await him/her in this new ter- 
ritory of his/her exploration where he/she may need new 
sophisticated technical tools that were not needed for 
handling macroscopic machines. For the explorer, this 
article also charts a tentative road map along with a sum- 
mary of the mathematical strategies that he/she might 
use to make progress in this frontier territory. Excursions 
to some of the listed border areas that overlap with areas 
of research in other branches of science and engineering 
could be enjoyable and intellectually rewarding for the 
explorer. For any explorer, it is good to know the experi- 
ence of the pioneers. The final section, on the evolution 
of the concept of biological machines, is likely to be en- 
joyed as a dessert by physicists, and as a fresh food for 
thought by historians and philosophers of science. 



II. A CATALOGUE: TYPICAL EXAMPLES OF 
MOLECULAR MACHINES AND FUELS 

Based on the mode of operation, the biomolecular ma- 
chines can be divided broadly into two groups. Cyclic 
machines operate in repetitive cycles in a manner that 
is very similar to that of the cyclic engines which run 
our cars. In contrast, some other molecular machines are 
one-shot machines that exhaust an internal source of free 
energy in a single round. Force exerted by a compressed 
spring, upon its release, is a typical example of a one-shot 
machine. 



A. Fuels for molecular machines 

The most common way of supplying energy to a nat- 
ural nanomachine is to utilize the chemical energy (or, 
more appropriately, free energy) released by a chemi- 
cal reaction. Most of the machines use the so-called 
high-energy compounds- particularly, nucleoside triphos- 
phates (NTPs)- as an energy source to generate the me- 
chanical energy required for their directed movement. 
The most common chemical reaction is the hydrolysis of 
ATP (ADP): ATP -^ ADP+P,. Some other high-energy 
compounds can also supply input energy; one typical ex- 
ample being the hydrolysis of Guanosine Triphosphate 
(GTP) to Guanosine Diphosphate (GDP). Under normal 
physiological conditions, hydrolysis of ATP is extremely 
slow. Most of the molecular motors fuelled by ATP func- 
tion also as ATPase enzyme (i.e., catalyzes hydrolysis of 
ATP) thereby speeding up the energy-supplying reaction 
by several orders of magnitude. 

If a cyclic machine runs on a specific chemical fuel then 
the spent fuel must be removed as waste products and 
fresh fuel must be supplied to the machine. Fortunately, 
normal cells have machineries for recycling waste prod- 
ucts to manufacture fresh fuel, e.g., synthesizing ATP 
from ADP. This raises an important question: since ATP 
is a higher-energy compound than ADP, how are the 
ATP-synthesizing machines driven to perform this ener- 
getically "uphill" task? Fortunately, chemical fuel is not 
the only means by which input energy can be supplied to 
intracellular molecular machines. 

A cell gets its energy from external sources. It has 
special machines to convert the input energy into some 
"energy currency" . For example, chemical energy sup- 
plied by the food we consume is converted into an electro- 
chemical potential A/i that not only can be used to syn- 
thesize ATP, but can also directly run some other ma- 
chines. In plants similar proton- motive forces are gen- 
erated by machines which are driven by the input sun- 
light. Only hydrogen ion {H~^), i.e., a proton, and sodium 
ion {Na'^) are used in the kingdom of life to create the 
electro-chemical potentials, i.e., it used either a proton- 
motive force (PMF) or a sodium-motive force (SMF) as 
an energy currency. 

The advantages of using light, instead of chemical re- 



Motor superfamily 


Filamentous track 


Minimum step size 


Myosin 


F-actin 


36 nm 


Kinesin 


Microtubule 


8 nm 


Dynein 


Microtubule 


8 nm 



TABLE I: Superfamilies of motor proteins and the corre- 
sponding tracks. 



action, as the input energy for a molecular motor are as 
follows: (i) light can be switched on and off easily and 
rapidly, (ii) usually, no waste product, which would re- 
quire disposal or recycling, is generated. 

Thus, study of molecular machines deals with two com- 
plementary aspects of bioenergetics: (a) conversion of 
energy input from the external sources into the energy 
currency of the cell, and (b) conversion of the energy 
currency to drive various active other active processes. 



the motor itself! It is desirable that such a motor "walks" 
for a significant distance on its track carrying the cargo; 
for obvious reasons, such motors are referred to as porters 
[13| . Kinesin and dyneins attached simultaneously to the 
same "hard" cargo can get engaged in a tug-of-war lead- 
ing to a bidirectional movement of the cargo [ij, Il5| . 
In regions of overlap between MT and F-actin filaments, 
a large cargo may be hauled simultaneously by kinesins 
and myosins which, however, walk on their respective 
tracks. Alternatively, along its journey route, a cargo 
may be transferred from the MT-based transport net- 
work, which dominate at the cell center, to the F-actin 
based network that covers most of the cell periphery [l6| . 
A "soft" car go p ulled by many kinesins can get elongated 
into a tube llTJI. 



B. Cytoskeletal motors 

The cytoskeleton of a cell is the analogue of the hu- 
man skeleton [3|. However, it not only provides mechan- 
ical strength to the cell, but its filamentous proteins also 
form the networks of "highways" (or, "tracks") on which 
cytoskeletal motor proteins [3, Hi can move. Filamen- 
tous actin (F-actin) and microtubules (MT) which serve 
as tracks are "polar" in the sense that the structure and 
kinetics of the two ends of each filament are dissimilar. 

The superfamilies of cytoskeletal motors and the cor- 
responding filamentous tracks are listed in table ID Every 
superfamily can be further divided into families. Mem- 
bers of every family move always in a particular direc- 
tion on its track; for example, kinesin- 1 and cytoplasmic 
dynein move towards -I- and - end of MT, respectively. 
Similarly, myosin-V and myosin- VI move towards the -|- 
and - ends of F-actin, respectively. 

For their operation, each motor must have a track- 
binding site and another site that binds and hydrolyzes 
ATP (so-called ATPase site). Both these sites are lo- 
cated, for example, in the head domain of myosins and 
kinesins both of which walk on their heads! The motor- 
binding sites on the tracks are equispaced; the actual step 
size of a motor can be, in principle, an integral multiple 
of the minimum step size which is the separation between 
two neighboring motor-binding sites on the correspond- 
ing track. 



1. Porters 

Some linear motors are cargo transporters 'l2f|; how- 
ever, the size of the cargoes are usually much larger than 



2. Machines for chipping filamentous tracks 



A MT depolymerase is a kinesin motor that chips away 
its own track from one end [1^ . Members of the kinesin- 
13 family can reach either end of the MT diffusively 
(without ATP hydrolysis) and, then, start chipping the 
track from the end where it reaches. In contrast, mem- 
bers of the kinesin-8 family walk towards the plus end of 
the MT track hydrolyzing ATP and after reaching that 
end starts chipping it from there [19l |20| . Chipping by 
both families of depolymerase kinesins are energized by 
ATP hydrolysis. 



Contractility: motor-filament crossbridge and collective 
dynamics of sliders and rowers 



Some motors are capable of sliding two different fila- 
ments with respect to each other by stepping simulate- 



nously on these two filaments 21|. Some sliders work in 



groups and each detaches from the filament after every 
single stroke; these are often referred to as rowers because 
of the analogy with rowing with oars jlSl. Contractility ^ 
rather than motility, at the subcellular and cellular level 
are driven by the sliders and rowers |22l . l23j . Some exam- 
ples of this category are listed in the table HH the details 
can be found in the cited review articles. 



J 



Motor 


Sliding filaments 


Function (example) 


Myosin 


"Thin filaments" (F-actin) of muscle fibers 


Muscle contraction [24] 


Myosin 


"Stress fibers" (F-actin) of non-muscle cells 


Cell contraction [25] 


Myosin 


Cytokinetic "contractile ring" (F-actin) in eukaryotes 


Cell division [26] 


Kinesin 


Interpolar microtubules in mitotic spindle 


Mitosis [27, 28 


Dynein 


Microtubules of axoneme 


Beating of eukaryotic flagella [29, 30] 


Dynein 


Microtubules of megakaryocytes 


Blood platelet formation [31] 



TABLE II: Few example of cytoskeletal rowers and sliders as 
well as their biological functions. 



Polymer 


mode of force generation 


Function (example) 


MT 


polymerizing pistion-like 


organizing cell interior [35] 


F-actin 


polymerization 


cell motility [36] 


FtsZ 


polymerization 


bacterial cytokinesis [37 


MSP 


polymerization 


motility of nematode sperm cells [38] 


Type-IV pili 


polymerization 


bacterial motility [39] 


MT 


de-polymerization 


Eukaryotic chromosome segregation [33] 


spasmin 


spring-like 


vorticellid spasmoneme [34] 


Coiled actin 


spring-like 


egg fertilization by sperm cell of the horse-shoe crab Limulus polyphemus [40] 



TABLE III: Force generation by polymeriz- 
ing/depolymerizing, coiling/ uncoiling filaments: pistons, 
hooks and springs. 



4-. Push and pull of cytoskeletal filaments: nano-pistons 
and nano-springs 

Elongation of filamentous biopolymers that presses 
against a light object (e.g., a membrane) can result in 
a "push" |32| . Similarly, a depolymerizing tubular fila- 
ment can "pull" a light ring-like object by inserting its 
hook-like outwardly curled depolymerizing tip into the 
ring [33|. A flexible filament, upon compression by in- 
put energy, can store energy that can perform mechan- 
ical work when the filament springs back to its original 
relaxed shape [34]. Some typical examples are given in 

tabicnni 



Machines for synthesis, manipulation and 
degradation of macromolecules of life 



Membrane 


Polymer 


Nuclear envelope 


RNA/Protein [43, 47] 


Membrane of endoplasmic reticulum 


Protein [48 


Membranes of mitochondria/chloroplasts 


Protein [49, 50] 


Membrane of peroxisome 


Protein [51] 



TABLE IV: Membrane-bound translocases. 



from three sources: (a) bending of stiff DNA molecule 
inside the capsid; (b) strong electrostatic repulsion be- 
tween the negatively charged strands of the DNA; (c) 
loss of entropy caused by the packaging [4a] . The pack- 
aging motor has to be powerful enough to overcome such 
a high pressure. 



1. Membrane-associated machines for macromolecule 
translocation: exporters, importers and packers 

In many situations, the motor remains immobile and 
pulls a macromolecule; the latter are often called translo- 
case. Some translocases export (or, import) either a pro- 
tein [41i| or a nucleic acid strand [42,,^] across the plasma 
membrane of the cell or, in case of eukaryotes, across in- 
ternal membranes. A list is provided in table lIVl 

The genome of many viruses are packaged into a pre- 
fabricated empty container, called viral capsid, by a pow- 
erful motor attached to the entrance of the capsid. As 
the capsid gets filled. The pressure inside the capsid in- 
creases which opposes further filling [4J: Eal ■ The effec- 
tive force, which opposes packaging, gets contributions 



2. Machines for degrading macromolecules of life 



Restriction-modification (RM) enzyme defend bacte- 
rial hosts against bacteriophage infection by cleaving 
the phage genome while the DNA of the host bacteria 
are not cleaved. Exosome and proteasome are machines 
that shred RNA and proteins into their basic subunits, 
namely, nucleotides and amino acids, respectively. Sim- 
ilarly, there are machines for degrading polysachharides, 
e.g., cellulosome (a cellulose degrading machine), starch 
degrading enzymes, chitin degrading enzyme (chitinase), 
etc. These machines are listed in table FVl 



Polymer 


Examples of Machines 


DNA (polynucleotide) 


RM enzyme [52] 


RNA (polynucleotide) 


Exosome [53] 


Protein (polypeptide) 


Proteasome [54] 


Cellulose (polysachharide) 


Cellulosome [56] 


Starch (polysachharide) 


Starch degrad. enzyme [55] 


Chitin (polysachharide) 


Chitinase [57] 



TABLE V: Machines for degradation of macromolecules of 
life. 



a RNA secondary structure. During DNA replication, 
a helicase moves ahead of the polymerase, like a mine 
sweeper, unzipping the duplex DNA and dislodging other 
DNA-bound proteins. However, the transcriptional and 
translational machineries do not need assistance of any 
helicase because these are capable of unzipping DNA and 
unwinding RNA, respectively, on their own. A helicase 
can be monomeric, or dimeric or hexameric. 



Machine 


Template 


Product 


Function 


DdDP 


DNA 


DNA 


DNA replication [58] 


DdRP 


DNA 


RNA 


Transcription [59-61] 


RdDP 


RNA 


DNA 


Reverse transcription [62] 


RdRP 


RNA 


RNA 


RNA replication [63] 


Ribosome 


mRNA 


Protein 


Translation [64-66] 



TABLE VL Types of polymerizing machines, the templates 
they use and the corresponding product of polymerization. 



3. Machines for template- dictated polymerization 

Two classes of biopolymers, namely, polynucleotides 
and polypeptides perform wide range of important func- 
tions in a living cell. DNA and RNA are examples of 
polynucleotides while proteins are polypeptides. Both 
polynucleotides and polypeptides are made from a lim- 
ited number of different species of monomeric building 
blocks, namely, nucleotides and amino acids, respectively. 
The sequence of the monomeric subunits to be used for 
synthesis of each of these are dictated by that of the 
corresponding template. These polymers are elongated, 
step-by-step, during their birth by successive addition of 
monomers, one at a time. The template itself also serves 
as the track for the polymerizer machine that takes chem- 
ical energy as input to polymerize the biopolymer as well 
as for its own forward movement. Therefore, these ma- 
chines are also referred to as motors. 

Depending on the nature of the template and prod- 
uct nucleic acid strands, polymerases can be classified 
as DNA-dependent DNA polymerase (DdDP), DNA- 
dependent RNA polymerase (DdRP), etc. as listed in 
the table rvn 



4- Unwrappers and unzippers of packaged DNA: chromatin 
remodellers and Helicases 



In an eukaryotic cell DNA is packaged in a hierarchi- 
cal structure called chromatin. In order to use a sin- 
gle strand of the DNA as a template for transcription 
or replication, it has to be unpackaged either locally or 
globally. ATP-dependent chromatin remodelers [67| are 
motors that perform this unpackaging. However, only 
one of the strands of the unpackaged duplex DNA serves 
as a templa.te; the duplex DNA is unzipped by a DNA 
helicase [63, |60|. Similarly, a RNA helicase unwinds 



D. Rotary motors 

Rotary molecular motors are, at least superficially, 
very similar to the motor of a hair dryer. Two rotary 
motors have been studied most extensively, (i) A rotary 
motor embedded in the membrane of bacteria drive the 
bacterial flagella [70, l71| which, the bacteria use for their 
swimming in a queo us media, (ii) A rotary motor, called 
ATP synthase [T^, [TJ], is embedded on the membrane 
of mitochondria, the powerhouses of a cell. A synthase 
drives a chemical reaction, typically the synthesis of some 
product; the ATP synthase produces ATP, the "energy 
currency" of the cell, from ADP. 



E. Processivity, duty ratio, stall force and dwell 
time 

One can define processivity of a motor in three different 
ways: 

(i) Average number of chemical cycles in between attach- 
ment and the next detachment from the track; 
(ii) m,ean time of a single run, i.e., in between an attach- 
ment and the next detachment of the motor from the 
track; 

(iii) mean distance spanned by the motor on the track in 
a single run. 

Since the definitions (ii) and (iii) are directly accessible 
to experiments, these are more useful than the definition 
(i). Another related, but distinct, concept is that of duty 
ratio which is defined as the average fraction of the time 
that each head spends remaining attached to its track 
during one cycle. 

To translocate processively, a motor may utilize one of 
the three following strategies: 

Strategy I: the motor may have more than one track- 
binding domain (oligomeric structure can give rise to 
such a possibility quite naturally) . Most of the cytoskele- 
tal motors like conventional two-headed kinesin use such 
a strategy 74]. One of the track-binding sites remains 
bound to the track while the other searches for its next 
binding site. 

Strategy II: A motor may possess non-motor extra do- 
mains or some accessory protein(s) bound to it which 
can interact with the track even when none of the motor 
domains of the motor is attached to the track. Dynein 
seems to exploit dynactin [75| to enhance its own proces- 
sivity. 



Strategy III: it can use a "clamp-like" device to remain 
attached to the track; opening of the clamp will be re- 
quired before the motor detaches from the track. For 
example, DdDP utilizes this strategy |7a |. 

An external force directed opposite to the natural di- 
rection of walk of a motor is called a load force. Average 
velocity of a motor decreases with increasing load force. 
The magnitude of the load force at which the average 
velocity of the motor vanishes, is called the stall force. 
The force-velocity relation is one of the most fundamen- 
tal characteristic properties of a motor. Its status in bio- 
physics is comparable, for example, to that of the I-V 
characteristics of a device in semiconductor physics. 

Two motors with identical average velocities may ex- 
hibits widely different types of fluctuations. Therefore, 
as we'll show in detail, a deeper understanding of the op- 
erational mechanism of a motor can be gained from the 
distributions of their "dwells" at the successive spatial 
positions on the track. 



F. Fundamental general questions 

(i) Single-molecule mechanism |77| : How do the in- 
teractions among the component structural units of an 
individual motor, motor-track interactions, motor-ligand 
(fuel) interactions and the mechano-chemical kinetics of 
the system determine, for example, (a) the directional- 
ity, (b) processivity, (c) dwell-time distribution, (d) force- 
velocity relation, and (e) efficiency of transduction? Does 
a given dimeric motor walk hand- over- hand or crawl like 
an inchworm, and why? Do the ATPase domains of a 
hexameric motor "fire" (a) in series, or (b) in parallel, 
or, (c) in random sequence? 

(ii) Multi-motor coordination in a "workshop": The 
motors do not work in isolation in- vivo. What are the 
mechanisms and consequences of the spatio-temporal co- 
ordination of the motors? For example, a single mitotic 
spindle consists of many polymerizing and depolymeriz- 
ing MTs, several types of cytoskeletal motors, including 
depolymerases [23|. A replisome, the workshop for DNA 
replication, has to coordinate the operations of clamps 
and clamp loaders with those of primases, polymerases, 
etc. [58|. Similarly, a ribosome is a mobile platform on 
which the operations of several devices have to be coordi- 
nated properly during protein synthesis [6J| . Well known 
co-directional as well as head-on collisions of polymerases 
and those of ribosomes can create traffic jam under some 
circumstances whereas a collision can restart stalled traf- 
fic in other circumstances (7811. 



III. MOTORING IN A VISCOUS FLUID: FROM 
NEWTON TO LANGEVIN 

Force is one of the most fundamental quantities in 
physics. As we'll argue in this section, some of the forces 



which dominate the dynamics of molecular machines have 
negligible effect on macroscopic machines. 



A. Newton's equation: deterministic dynamics 

For simplicity, let us first consider a hypothetical sce- 
nario where neither the tracks nor the fuel molecules are 
present in the aqueous medium in which there is a free 
(i.e., not bound to any other molecule) motor protein. 
Suppose the medium consists of only N "particle-like" 
molecules and the center of mass of the motor protein 
is also represented by a "particle" . Then, the exact tra- 
jectories of all these A^ 4- 1 "particles" can be obtained 
by solving the corresponding coupled Newton's equations 
that describe the dynamics of the A^ -|- 1 particles. In 
reality, this approach is impractical because, even with 
the largest and fastest computers available at present, 
we cannot get the trajectories over time intervals of the 
order of Is which is relevant for majority of the motor 
proteins as long as A^ is large (A^ ^ 10^'^). 



B. Langevin equation: stochastic dynamics 

A more pragmatic approach would be to solve only the 
equations of motion for the motor protein by treating the 
N particles of the medium as merely the constituents of 
a reservoir. In other words, one monitors the motion in a 
6-dimensional subspace of the full 6(A^ -I- l)-dimensional 
phase space of the system. The price one has to pay for 
this simplification is that instead of the original Newton's 
equation for the motor protein, one has to now solve a 
Langevin equation that describes the "Brownian" motion 
of the motor protein. Since we are soon going to extend 
the discussion in the presence of filamentous tracks which 
are essentially one-dimensional, we write the Langevin 
equation for the "Brownian" motion of the motor protein 
in one-dimension m{d'^ x / dt'^) = Fd -\- ^ where m is the 
mass of the protein, Fd = —jv is the viscous drag and and 
^ is the random Brownian force. Note that the Langevin 
equation is neither deterministic nor symmetric under 
time-reversal {t — ?> —t). 



1. Motoring in a "sticky" medium: Purcell's idea 

Using the Stokes formula 7 — Q'Kr^r for a globular pro- 
tein of radius r ^ lOnm moving with an average ve- 
locity V ^ Im/ sec (corresponding to Inm/ns) in the 
aqueous medium of viscosity rj ^ 10^^ Pas we get an 
estimate Fd ^ 200pN. Surprisingly, this viscous drag 
force is about 200 times larger than the elastic force it 
experiences when stretched by Inm ! Consequently, the 
Reynold 's number, which is the ratio of the inertial and 
viscous forces, is at most of the order 0(10^^). The 
Reynold's number will be of the order of 10~^ if you ever 
try (not recommended) to swim in honey! 
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Now let us supply the tracks and fuel molecules to the 
motor protein in the same medium. The Langevin equa- 
tion will now include additional terms -Fgxt which rep- 
resents the net force arising from the interaction of the 
motor protein with its track and the ligand; the ligand 
could be the fuel molecule or the molecules produced by 
its "burning" (e.g., ATP or ADP). Besides, since the dy- 
namics of motor proteins is expected to be dominated by 
hydrodynamics at low Reynold's number [79[, one gener- 
ally drops the inertial term. In this "overdamped" regime 

lv = F,,t+^ (1) 



2. Motoring under random impacts from surroun, 
Brownian force 



The energy released by the hydrolysis of a single ATP 
molecule is about 10~^^J. Interestingly, the mean ther- 
mal energy ksT associated with a molecule at a temper- 
ature of the order of T ~ lOOK, is also ksT - IQ-^i J. 
Moreover, equating this thermal energy with the work 
done by the thermal force Ft in causing a displacement 
of Inm we get Ft ^ IpN. This is comparable to the elas- 
tic force experienced by a typical motor protein when 
stretched by Inm. Thus, a motor protein that gets bom- 
barded from all sides by random thermal forces is similar 
to a tiny creature is a very strong hurricane! Therefore, 
in contrast to the deterministic dynamics of the macro- 
scopic machines, the dynamics of molecular motors is 
stochastic (i.e., probabilistic). 

Already in the first half of the twentieth century 
D'Arcy Thompson, father of modern bio-mechanics, real- 
ized the importance of viscous drag and Brownian forces 
at the cellular (and subcellular) level. He pointed out 
that |80j | that in this microscopic world ^^gravitation is 
forgotten, and the viscosity of the liguid,..., the molecu- 
lar shocks of the Brownian movement, .... the electric 
charges of the ionized medium, make up the physical en- 
vironment ... predominant factors are no longer those of 
our scale" . 



IV. ENERGY TRANSDUCTION BY 
MOLECULAR MACHINES: FROM CARNOT TO 

FEYNMAN 

Molecular motors generate force by transducing en- 
ergy. Interestingly, one of the two mechanisms of en- 
ergy transduction that I describe here, does not have 
any analogous macroscopic counterpart. I explain in con- 
siderable detail why thermodynamic formalisms, which 
have been successfully utilized to calculate the common 
performance characteristics of macroscopic machines, are 
inadequate for natural nano- machines. 

Molecular motors are made of soft matter whereas 
macroscopic motors are normally made of hard matter 
to withstand wear and tear. Nature seems to exploit the 



high deformability of molecular motors for its biological 
function. But, this difference is minor compared to the 
others and will not be elaborated further here. 



A. Molecular machines are run by isothermal 
engines 

This is in sharp contrast to the macroscopic thermal 
engines which require at least two thermal reservoirs at 
different temperatures and which convert part of the in- 
put heat energy into mechanical work. 

Why can't a molecular machine work as a heat en- 
gine? In order to examine the viability of a intracellular 
heat engine, let us create a temperature gradient on a 
length scale i ~ lOnm. An elementary analysis of heat 
diffusion equation is adequate to show that a tempera- 
ture gradient on a length scale i relaxes within a time 
interval Ttemp ^ c^i^ j k where Ch is the specific heat per 
unit volume and k is the thermal conductivity. Using 
the typical characteristic values of Ch and n, for water, 
one finds 8l| Ttem-p — 10~^ — 10~^s. Thus, temperature 
gradients cannot be maintained for the entire duration of 
even one single cycle of a cyclic molecular machine which 
is, typically, few orders of magnitude longer than Ttemp- 
In other words, for all practical purposes, natural nano- 
machines operate isothermally. Because of the condition 
T = constant, the molecular machines operate as free 
energy transducers. 



B. Inadequacy of equilibrium thermodynamics 

A majority of the molecular motors are chcmo- 
mechanical machines for which input and output are 
chemical energy and mechanical work, respectively. Re- 
call that for Carnot's thermo-mechanical machine, the 
thermodynamic efficiency is rjeq-th = 1 — {Tl/Th) where 
Tff and Tl are the temperatures of the two reservoirs 
at high and low temperatures, respectively. Similarly, 
for an isothermal chemo-mechanical engine, the thermal 
reservoirs would be replaced by chemical reservoirs at 
fixed chemical potentials hh and /i^ {pin > Pi-l)- Con- 
sequently, the heat flow of the thermal engine would be 
replaced by particle flow in the chemical engine. Just 
as the difference of heat input and output in the heat 
engine is converted to mechanical work, the difference 
Mff ^ /^i would be converted into output work in the 
chemical engine. Obviously, the corresponding thermo- 
dynamic efficiency would be [83] Veq-ch = 1 — (Mi/z^ff)- 
Note that, because of the quasi-static nature of each step 
in the equilibrium thermodynamic theory of these cyclic 
engines, each cycle takes infinite time. Naturally, the 
power output Vout = for both the thermal and chemi- 
cal Carnot engines. 

Can we apply the theory of such isothermal Carnot en- 
gines to a chemo-mechanical molecular machine by iden- 
tifying, for example, fin and ^l {i^h > Ml) £^s the chem- 



ical potentials of ATP and ADP, respectively? The an- 
swer is: NO. The cycle time of the real molecular ma- 
chines is finite and their power output is non-zero. More- 
over, motors are examples of open systems J8J| which 
continue to run in repetitive cycles as long as energy is 
pumped in. Such a system cannot be in thermodynamics 
equilibrium. 



C. Inadequacy of endo- reversible thermodynamics 

For macroscopic engines with finite cycle time, the 
characteristics of performance are often reliably calcu- 
lated within the theoretical framework of endo-reversible 
thermodynamics 8J| . In this case, the ratio of the rates 
of output and input energies defines the efficiency of 
transduction. The rate of output work is called power 
output of the engine. 

The formalism of endo-reversible thermodyamics for 
heat engines is based on the assumption that the work- 
ing substance of the engine is coupled to the two thermal 
reservoirs by heat conductors of finite thermal conduc- 
tivity (non-zero thermal resistance). Entire dissipation 
is assumed to take place in the irreversible process of 
heat conduction through these thermal resistors whereas 
all the processes in the working material of the engine 
are assumed to be fully reversible (i.e., no entropy is gen- 
erated internally). In this scenario, the corresponding 
thermal conductances in the Carnot engine are infinite. 

Efficiency at maximum power output r](Vmax), rather 
than maximum efficiency itself, is the most appropriate 
quantitative measure of the performance of such engines. 
The upper-bound of rj{Vmax) is given by the Curzon- 
Ahlborn expression ^] T]cA{Vmax) = 1 - {Tl/ThY''^. 
Similarly, within the framework of the endoreversible 
thermodynamics, a phenomenological theory for chem- 
ical engines can also be formulated if the cycle time is 
finite [86| . Can't we use this approach for molecular ma- 
chines which are also chemical engines with finite cycle 
times? 

Recall that endo-reversible thermodynamics is based 
on the assumption that dissipation takes place only in 
the process of transfer of matter between reservoirs and 
the working substance whereas processes that the work- 
ing substance goes through in each cycle are perfectly 
reversible (and, therefore, non-dissipative). This assump- 
tion is valid if the relaxations in the working substance 
of the engine are very rapid compared to the processes 
involving the coupling of the working substance with the 
reservoirs. The validity of this assumption requires clear 
separation of time scales of relaxation in the working 
substance and in the working substance-reservoir cou- 
pling. But, such separation of time scales does not hold 
in molecular machines which consist of macromolecules. 



D. Domain of non-equilibrium statistical mechanics 

In general, molecular motors operate far beyond the 

linear response regime and, therefore^ the formalism 

of non-equilibrium thermodynamics [87| for coupled 
mechano-chemical processes is not applicable. More- 
over, thermodynamic formalisms developed for macro- 
scopically large systems do not account for the sponta- 
neous fluctuations. On the other hand, because of the 
small size of a molecular motor and because of the low 
concentrations of the molecules involved in its operation, 
fluctuations of positions, conformations as well as the 
cycle times are intrinsic features of their kinetics. There- 
fore, one has to use the more sophisticated toolbox of 
stochastic processes and non-equilibrium statistical me- 
chanics for theoretical treatment of molecular machines. 
Interestingly, as we argue below, noise need not be a nui- 
sance for a motor; instead, a motor can move forward by 
gainfully exploiting this noise! 



E. Defining efficiency: from Carnot to Stokes 

The performance of macroscopic motors are charac- 
terized by a combination of its efficiency, power output, 
maximum force or torque that it can generate. Just like 
the performance of their macroscopic counterparts with 
finite cycle time, that of molecular motors 88] have also 
been characterized in terms of efficiency at maximum 
power, rather than maximum efficiency. However, the 
efflciency of molecular motors can be defined in several 
different ways |89| . 

The efficiency of a motor, with finite cycle time, is 
generally deflned by 



^ / out/ ih 



(2) 



where Vin and Vout are the input and output powers, 
respectively. The usual deflnition of thermodynamic effi- 
ciency r]T is based on the assumption that, like its macro- 
scopic counterpart, a molecular motor has an output 
power [90| 



I out — ^ exty ' 



(3) 



where Fe^t is the externally applied opposing (load) force. 
Although this deflnition is unambiguous, it is unsatisfac- 
tory for practical use in characterizing the performance 
of molecular motors. As explained earlier, a molecular 
motor has to work against the omnipresent viscous drag 
in the intracelluar medium even when no other force op- 
poses its movement (i.e., even if F^xt = 0). 

A generalized efficiency rjc is also represented by the 
same expression ([2]) where, instead of (|3]), the output 
power is assumed to be j91|] 



Vout = F,,tV + -iV . 



(4) 



This definition treats the load force and viscous drag on 
equal footing. In contrast, the "Stokes efflciency" rjs for a 
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molecular motor driven by a chemical reaction is defined 

as lii 



Vs 



7^2 



iAG){r) + F,,tV 



(5) 



where (r) is the average rate of the chemical reaction and 
AG is the chemical free energy consumed in each reaction 
cycle. This efficiency is named after Stokes because the 
viscous drag is calculated from Stokes law. 

As we show in the next subsection, the directional 
movement of some motors arises from the rectification of 
random thermal noise. For such motors, an alternative 
measure of performance is the rectification efficiency [93J . 
Thus, different definitions of the efficiency of a molecular 
motor may arise from different interpretations of their 
tasks or may characterize distinct aspects of their oper- 
ation. Nevertheless, for any definition of efficiency, it is 
essential to ensure that its allowed values lie between 
and 1. 



F. Noisy power stroke and Brownian ratchet 

If the input energy directly causes a conformational 
change of the protein machinery which manifests itself 
as a mechanical stroke of the motor, the operation of the 
motor is said to be driven by a "power stroke" mecha- 
nism. This is also the mechanism used by all man made 
macroscopic machines. However, in case of molecular 
motors, the power stroke is always "noisy" because of 
the Brownian forces acting on it. 

Let us contrast this with the following alternative sce- 
nario: suppose, the machine exhibits both "forward" and 
"backward" movements because of spontaneous thermal 
fluctuations. If now energy input is utilized to prevent 
"backward" movements, but allow the "forward" move- 
ments, the system will exhibit directed, albeit noisy, 
movement in the "forward" direction. Note that the for- 
ward movements in this case are caused directly by the 
spontaneous thermal fluctuations, the input energy rec- 
tifies the "backward" movements. This alternative sce- 
nario is called the Brownian ratchet mechanism [94, 95] . 
The concept of Brownian ratchet [9a] was popularized 
by Richard Feynman with his ratchet-and-pawl device 
[97| . However, in the context of molecular motors, the 
Brownin ratchet mechanism was proposed by several 
groups in the 1990s ^^. 

Thus, in principle, there are two idealized scenarios for 
a transition from a conformation A to a conformation B- 
one is by a pure power stroke and the other by a purely 
Brownian ratchet mechanism [99j. However, for a real 
molecular motor, it is difficult to unambiguously distin- 
guish between a power stroke and a Brownian ratchet 
[lOOJ |: the actual mechanism may be a combination of 
the two idealized extremes. 
•Are Brownian motors Maxwell's demon? 

Brownian ratchet mechanism transduces random ther- 
mal energy of the reservoir into mechanical work. Does 



it imply that these motors are perpetual motors of the 
second kind that violate the second law of thermody- 
namics? In that case, does it have any similarity with 
the Maxwell's demon [101]? A short answer is: No, a 
Brownian ratchet is not a Maxwell's demon because it is 
an open system far from the state of equilibrium whereas 
the second law of thermodynamics is strictly valid only 
for systems in stable thermodynamic equilibrium. 



G. Physical realizations of Brownian ratchet in 
molecular motors 



Is there any real biomolecular motor which can be re- 
garded as a true physical realization of Brownian ratch- 
ets? The answer is an emphatic "yes" and we list a few 
typical examples here. 

KIFIA, a single-headed kinesin, is an example of cy- 
toskeletal motor whose operational mechanism can be in- 
terpreted as a Brownian ratchet [102h104J . The original 
acto-myosin crossbridgc model suggested by Hu xley |l05l | 
can be interpreted as Brownian ratchet [106] although 
Huxley himself did not use this terminology. Brownian 
ratchet can also account for the force generation by a 
polymerizing cytoskeletal filament [107| . 

The crossing of a membrane during the export/import 
of a protein of length L (amino acid monomers) takes 
place at a faster rate by the Brownian ratchet mechanism 
compared to that by pure translational diffusion flOs]. 
It has been claimed that the translocation of a mRNA 
molecule across the nuclear membrane of an eukaryotic 
cell takes place also by a similar Brownia n ra tchet mech- 
anism through the nuclear pore complex |109l | . The elon- 
gation of a mRNA during transcription by a RNA poly- 
merase can also be a physical realization of the Brownian 
ratchet mechanism jllOl llll| . Translocation, one of the 
crucial steps of the mechano-chemical cycle of a ribosome 
is a physical realization of Brownian ratchet [65| . 



V. MATHEMATICAL DESCRIPTION OF 

MECHANO-CHEMICAL KINETICS: 

CONTINUOUS LANDSCAPES VS. DISCRETE 

NETWORKS 



We combine the fundamental principles of (stochas- 
tic) nano-mechanics and (stochastic) chemical kinetics to 
formulate the general theoretical framework for a quan- 
titative description of the mechano- chemistry or chemo- 
mechanics of molecular motors. We mention a few alter- 
native formalisms. 
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A. Motor kinetics as wandering in a 

time-independent mechano-chemical free-energy 

landscape 



This approach is useful for an intuitive physical ex- 
planation of the coupled mechano-chemical kinetics of 
molecular motors. At least one of the independent coor- 
dinate axes of this landscape represents the position of 
the motor in real space (i.e., its "mechanical coordinate") 
while at least another represents its chemical state (.e., 
"chemical coordinate" ) . Therefore, the minimum dimen- 
sion of this "land" is 2 and the free energy can be rep- 
resented by the hight at each point on the "land" . Both 
the position and chemical state variables are assumed 
to be continuous. The profiles of the free energy along 
both the position and chemical coordinates, obtained by 
taking appropriate cross sections of the free energy land- 
scape, are periodic with the same periodicity in both the 
directions. But, the profile is tiled forward along the 
chemical direction so that the bottom of the minima are 
located deeper in the forward direction along the "chem- 
ical" coordinate; this tilt accounts for the lowering of free 
energy caused, for example, by ATP hydrolysis. 



B. Motor kinetics as wandering in the 
time-dependent real-space potential landscape 



Consider those special situations where the chemical 
states of the motor are long lived and change in fast dis- 
crete jumps so that the position of the motor can con- 
tinue to change without alteration in its chemical state, 
except during chemical transitions when position remains 
frozen. Thus, no mixed mechano-chemical transition is 
allowed in this scenario. In such situations, we can as- 
sume that the potential landscape in real space remains 
unchanged for a while, during which the wanderings of 
the motor in this landscape (i.e., the positional dynamics 
of the motor) is governed by the "frozen" spatial profile 
of the potential. The profile of the potential switches se- 
quentially from one form to another and the sequence of 
m profiles is repeated in each cycle although the switch- 
ing times are random. Different profiles correspond to 
different motor-track interactions which is dependent on 
the nature of the ligand (if any) bound to the motor. 

In this formulation we assume that the allowed po- 
sitions of the motor on its track form a continuum x. 
The discrete index /x = l,2,...,Af denote the M dis- 
tinct spatial profiles of the potential V^{x) that are pos- 
tulated apriori. At any instant of time i, the state 
of the motor is given by {x,fj,). In the overdamped 
regime, the time-evolution of the position x( t) of the mo- 
tor obeys the Langevin equation (jlj with |112J | Fext = 
—dVfj,{x) / dx + Fioad- The time-dependence of the profile 



of the potential V^ {x) is governed by the master equation 
dPf^ix.t) 



dt 



= ^ P^, (x, i) W^M'^M (x) - 5] P^x, i)iy, 



M-^-M 



(x) 



where Pf^(x,t) is the probability that the motor protein 
"sees" the landscape Vfj_{x) at time t and Wfj,^fj,'{x) is the 
transition probability per unit time for a transition from 
the chemical state fi to the chemical state n' at position 

X. 

Instead of the Langevin equation, an equivalent 
Fokker-Planck equation can be used to describe the wan- 
dering of the motor protein in the potential energy land- 
scape Vfi{x). The advantage of this alternative formu- 
lation is that the Fokker-Planck equation can be com- 
bined naturally with the master equation that describes 
the time-evolution of the potential energy landscape. In 
this hybrid formulation, Pp(a;,t) represents the probabil- 
ity that, at time i, the motor is located at x, while "see- 
ing" the potential energy landscape V^{x). The equation 
governing the time dependence of Pf^{x,t) 



dP^ix,t) 
dt 



1_5_ 
rj dx 



{V'{x)~F}P^{x,t) 



kBT\d^Pf,ix,t) 



rj J dx^ 

Y,P^.[x,t)W^,,^,X^) 



Y^P^.[x,t)W^^^\x) 



(7) 



where 77 is a phenomenological coefficient that captures 
the effect of viscous drag. Note that there is no term 
in this equation which would correspond to a mixed 
mechano-chemical transition. 



C. Motor kinetics as a jump process in a fully 
discrete mechano-chemical network 



In this formulation, both the positions and "internal" 
(or "chemical" ) states of the motors are assumed to be 
discrete. Let P^{i,t) be the probability of finding the 
motor at the discrete position labelled by i and in the 
"chemical" state ^ at time t. Then, the master equation 
for Pf^{i,t) is given by 



(6) 



dP^Xht) 



fj.' p.' 

+ EI]^p'(j,Owp'-^p(j^*) 

(8) 






j^i m' 
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where the terms enclosed by the three different brackets 
[.] correspond to the purely mechanical, purely chemical 
and mechano-chemical transitions, respectively. 

As a concrete example, which will be used also for on 
several other occasions later in this review, consider a 2- 
state motor that, at any site j, can exist in one of the 
only two possible chemical states labelled by the symbols 
1 j and 2j . We assume the mechano-chemical cycle of this 
motor to be 



ratio 



li ^ 2j ^ lj+ 






i+i 



(9) 



where the rates of the allowed transitions are shown ex- 
plicitly above or below the corresponding arrow. Note 
that the transition Ij ^ 2j is purely chemical whereas 
the transition 2j — >■ Ij+i is a mixed mechano-chemical 
transition. The corresponding master equations are given 
by 



dPiii) 



dt 



^ '^2P2{l - 1) + ^-lP2{^ - ^iPlii) 



dP2{t) 
dt 



= UJlPl{i) - W_iP2(«) -t^2-P2(i) 



(10) 



Suppose a load force i^e^-t opposes the meacho-chemical 



transition 2, 



-^ 



Ij+i- The effect of the load force 



can be incorporated by assuming the load-dependence 
of the corresponding rate constant to be of the form 
i^2iFext) = ^2{0)exp[-Fext/ikBT)] whcrc 012(0) is the 
value of the rate constant in the absence of any external 
force. We'll see some implications of these equations in 
several specific contexts later in this article. 



D. Balance conditions for mechano-chemical 
kinetics: cycles, and flux 

On a discrete mechano-chemical state space, each state 
is denoted by a vertex and the direct transition from one 
state (denoted by, say, the vertex i) to another (denoted 
by, say, the vertex j) is represented by a directed edge 
\ij >. The opposite transition from j to i is denoted 
by the directed edge \ji >. A transition flux can be 
defined along any edge of this diagram. The forward 
transition flux J|y> from the vertex i to the vertex j is 
given by WjiPi while the reverse flux, i.e., transition flux 
J\jiy from j to i is given by WijPj. Therefore, the net 
transition flux in the direction from the vertex i to the 
vertex j is given by Jji = WjiPi — WijPj. 

A cycle in the kinetic diagram consists of at least three 
vertices. Each cycle C^ ca n be decomposed into two di- 
rected cycles (or, dicycles) fllS] C?f where d = ± corre- 
sponds the clockwise and counter-clockwise cycles. The 
net cycle flux J{C^) in the cycle C^, in the CW direction, 
is given by J(C^) = J[C+) - J[C-). 

For each arbitrary dicycle C?f, let us define the dicycle 



net) = n 



<ij>eC+ 



(11) 

where the superscript /i, d on the product sign denote a 
product evaluated over the directed edges of the cycle. 
So, by definition, n{C-) = l/7^(C'+) 

It has been proved rigorously [il4| that, for detailed 
balance to hold, the necessary and sufficient condition to 
be satisfied by the transition probabilities is 



7^(C;^) = 1 for aU dicycles C^, 



fl2) 



For a no n-equilibrium steady state (NESS), one can 
define [113J | the dicycle frequency r2**(CfJ) which is the 
number of dicycles C^ completed per unit time in the 
NESS of the system. Then, 

f}-(c;)/f7-(c;) = u^^^ {w,,/w,j) = 7^(c•^) (i3) 

Clearly, n{Cf^) 7^ 1, in general, for any NESS. 

Detailed balance is believed to be a property of systems 
in equilibrium whereas the conditions under which molec- 
ular machines operate are far from equilibrium. Does it 
imply that detailed balance breaks down for molecular 
machines? The co rrect answ er this subtle question needs 
a careful analysis |ll5l4ll7 |. 

If one naively assumes that the entire system returns to 
its original initial state at the end of each cycle one would 
get the erroneous result that the detailed balance breaks 
down. But, strictly speaking, the free energy of the full 
system gets lowered by \5G\ (e.g., because of the hydrol- 
ysis of ATP) in each cycle although the cyclic machine 
itself comes back to the same state. When the latte r fact 
is incorporated correctly in the analysis 115l4ll7 |. one 
finds that detailed balance is not violated by molecular 
machines. This should not sound surprising- the transi- 
tion rates "do not know" whether or not the system has 
been driven out of equilibrium by pumping energy into 
it. 



VI. SOLVING FORWARD PROBLEM BY 

STOCHASTIC PROCESS MODELING: FROM 

MODEL TO DATA 

A. Average speed and load-velocity relation 

For simplicity, let us consider the kinetic scheme shown 
in fig. (HJa)). In terms of the Fourier transform 



P^(fc,t)= £ P,{x„t)e-^^^ 



(14) 



j = -oo 



of P^ {xj , t) , the master equations can be written as a 
matrix equation 



9P(fc,t) 
di 



W(fc)P(fc,i) 



(15) 
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(a) 



Chem. 
State 




(b) 
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Position 

FIG. 1: Three examples of different types of networks of dis- 
crete mechano-chemical states. The bullets represent the dis- 
tinct states and the arrows denote the allowed transitions be- 
tween the two corresponding states. The scheme is (a) is 
unbrached whereas that in (b) has branched pathways con- 
necting the same pair of states. The mechanical step size is 
unique in both (a) and (b) whereas steps of mor e tha n one 
size are possible in (c). (Adapted from fig.l of ref. [l20l ]'). 



where P is a column vector of M components {fi = 
1, ...M) and W(fc) is the transition matrix in the fc-space 
(i.e., Fourier space). Summing over the hidden chemical 
states, we get 



M 



M 



p{k,t) = Y,p^{k,t)^Y. E p^^i^i^ty 



■ikxj 



(16) 



Taking derivativ es of both sides of ([16]) with respect to q 

we get [nl [ni 






.f dP{k,t) 
\ dk 

dP 

fe=o ^ dk 



fc=0 
2 



fc=0 



< x{t) > 

<x'^{t) > - < x{t) >^ 
(17) 



Evaluating P(k,t), in principle, the stationary drift ve- 
locity (i.e., asymptotic mean velocity) V and the corre- 
sponsding diffusion constant D can be obtained from 



V — linit- 



d 



( dP{k,t) 
\ dk 



fe=0 






d^P{k,t) 

9fc2 



/'dp{k,t)y " 

fc=o V ^^ /fc=o. 


(18) 



It may be tempting to attempt a direct utilization of 
the general form 



P(fc,t) = ^i3, 



,A^(fe)t 



(19) 



where the coefficients -B^^ are fixed by the initial condi- 
tions and Xfj.{k) are the eigenvalues of W(A:). However, 
for the practical implementation of this method analyti- 
cally the main hurdle would be to get all the eigenvalues 
of W(fc). Fortunately, only the smallest eigenvalue Xmim 
which dominates Pik) in the limit t — > oo, is required for 

r 1 I ~rii 

evaluating V and D [1181 . I119J : 



V = 
D = 



■ I ClAminyk, t) 



V dk 
l/d^X. 
"2 



fc=0 

.{k,t) 



dB 



(20) 



fc=0 



Even the forms (|20|) are not convenient for evaluating 
V and D. Most convenient approach is based on the 
characteristic polynomial Q{k) associated with the ma- 
trix W(fc), i.e., Q(fc;A) = det[\l - W(fc)]. Therefore, 
Xmin{k) is a root of the polynomial Q{k; A), i.e., solution 
of the equation 



M 



g(fc;A) = ^C^(fc)[A(fc)]^-0. 



(21) 



Hence [TTl [TT^ 
V = 
D = 



Ci(0) 
Ci; - 2iC[{0)V - 2C2{0)V^ 



2Ci(0) 



(22) 



M=i 



fj,= lj = -00 



where C' = [dC^ik)/dk]k=o- 
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For a postulated kinetic scheme, W is given. Then 
the expressions (P^ are adequate for analytical deriva- 
tion of V and D for the given model. However, in order 
to calculate the distributions of the dwell times of the 
motors, it is more convenient to work with the Fourier- 
Laplace transform, rather than the Fourier transform, of 
the probability densities. Therefore, we now derive al- 
ternative expressions for V and D in terms of the full 
Fourier-Laplace transform of the probability density. 

Taking Laplace transform of ^^ with respect to time 



P^iKs) 



Pf,{k,t)e- 



(23) 



the matrix form of the master equation reduces to 
P{k,s) = R(fc,s)"ip(0) where R(fc, s) = si - W(fc) 
and P(0) is the column vector of initial probabilities. 
Now the characteristic polynomial is th e determinant of 
R(fc, s) which can be expressed as |l2Cl| 



|R(fc,s) 



M 



•Ak)s>^ 



0. 



(24) 



Equati on (l24l) 
we get jl20l | 

V 

D 



is formally similar to ([2T|) . As expected. 



ci(0) 



2c2(0)T/2 



2ci(0) 



(25) 



•Example: an M-step unbranched mechano- 
chemical cycle 

As an illustrative example, let us consider the un- 
branched mechano-chemical cycle [SJ with M = 4, as 
shown in figl2] This special value of M is motivated by 
the typical example of a kinesin motor for which the four 
essential steps in each cycle are as follows: (i) a substrate- 
binding step (e.g., binding of an ATP molecule), (ii) a 
chemical reaction step (e.g., hydrolysis of ATP), (iii) a 
product- release step (e.g., release of ADP), and (iv) a 
mechanical step (e.g., power stroke). 




FIG. 2: An unbranched mechano-chemical cycle of the molec- 
ular motor with M = 4. The horizontal dashed line shows the 
lattice which represents the track; j and j + 1 represent two 
successive binding sites of the motor. The circles labelled by 
integers denote different "chemic al" s tates in between j and 
j + 1. (Adapted from fig.7 of ref.[l23|). 

Suppose, The forward transitions take place at rates 
Ui whereas the backward transitions occur with the rates 



Wj . The average velocity V of the motor is given by [a| 



V = 



1 



R 



M 



J=0 



where 



with 



R 



M 



1 



M-1 
J=0 



M-1 k 

En 

fc=l i=l 



Uj+i 



while D is given by 

'{VSM + dUM) {M + 2)V' 



D = 



i?2 



M 



where 



and 



while, 



M-1 M-1 

Sm = E *J E ('^ + l)'''=+i+i 

3=0 



k=0 



M-1 
j=0 



M-1 k 

En 



t'j+i-i 



^j-i 



(26) 



(27) 



(28) 



(29) 



(30) 



(31) 



(32) 



fe=i 1=1 

For various extensions of this scheme see ref . Q . 

In the simpler case shown in (|9]), where M = 2, and 
the second step is purely irreversible, using the step size 
£ explicitly (to make the dimensions of the expressions 
explicitly clear), we get 



V 



UJlU!2 



-? 



Wi -I- W_i + UJ2 



LOl + W-l + L02 



(33) 



Note that if, in addition, a;_i vanishes, i.e., if both the 

steps are fully irreversible, then d/V — wf"^ -|-w^^, i.e., 

the average time taken to move forward by one site is the 

sum of the mean residence time in the two steps of the 

cycle. 

•EfTects of branched pathways and inhomo- 

geneities of tracks 

Microtubule- associated proteins and actin-related pro- 
teins can give rise to inhomogeneities of the tracks for 
cytoskeletal motors. The intrinsic sequence inhomogene- 
ity of DNA and RNA strands can nontrivial influences on 
the motors that use nucleic acid strands as their tracks 
[122 
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B. Jamming on a crowded track: flux-density 
relation 



In the preceeding subsection we considered lone motor 
moving along a filamentous track. Now we consider a 
track with heavy motor traffic. In principle, unless con- 
trolled by some regulatory signals, formation of "traffic 
jams" fl23l- ll25l | on crowded tracks cannot be ruled out 
[l26 129]. These phenomena are formally similar to sys- 
tems of sterically interacting self-propelled particles. One 
of the simplest theoretical models for such systems is the 
totally asymmetric simple exclusion process (TASEP). 
In fact, TASEP was originally inspired by traffic of ri - 
bosomes on a mRNA track during translation |,130l4l33| . 
However, in recent years it has found application also in 
the context of traffic-like collective movements of ribo- 
somes in more realistic models ;134. 135] as we ll as many 
other types of molecular motors, e.g., kinesins J103lll04| - 
RNA polymerase |l36l . Il37| . gro wth of fungal hyphae 
[138] , growth of bacterial flagella [l39j | , crowd ing o f de- 
polymerizing kinesins at the tip of a microtube [140j, etc. 



Information processing machines: fidelity versus 
power and eflficiency 



Power and efficiency are most appropriate quantities to 
characterize the performance of porters, rowers, sliders, 
etc. But, not all machines fall in these categories. For 
information processing machines, which we consider in 
this subsection, fidelity of information transfer is more 
important than power and efficiency. 

The sequence of the monomeric subunits of a polynu- 
cleotide or that of a polypeptide is dictated by that of 
the corresponding template strand. During the polymer- 
ization process, after preliminary tentative selection, the 
selected monomeric subunit is subjected to several levels 
of checks by the quality control system of the polymeriz- 
ing machinery. Only after a selected subunit is screened 
by such a stringent test, it is covalently bonded to the 
elongating biopolymer. 

Discrimination of monomers based merely on the dif- 
ferences of free-energy gains cannot account for the ob- 
served high fidelity of these p rocesses. For example, "ki- 
netic proofreading" [l4llll42j | has been invoked to explain 
the kinetic processes that contribute to the high level of 
translational fidelity. Follow ing t hree features are essen- 
tial for kinetic proofreading [l43| : 
(i) formation of an initial enzyme-reactant complex, 
(ii) a strongly forward driven step that results in a high- 
energy intermediate complex, and 

(iii) one or more branched pathways along which dissocia- 
tion of the enzyme-reactant complex, and rejection of the 
reactant, is possible before the complex gets an opportu- 
nity to make the final transition to yield the product. 



Let is consider the kinetic scheme shown below 



M, 



Pn + S^ h 



h 

4. 



M 



j+i 



Pn^ 



(34) 



where Mj denotes the polymerase motor located at the 
discrete position j on its template, P„ represents the 
elongating biopolymer consisting of n subunits and S is 
a single subunit. Note that the scheme shown in ([M)) 
is, at least, formally an extension of ([9]) where an extra 
intermediate state I2 and a branched path from I2 have 
been added. This scheme is one of the simplest possible 
implementations of kinetic proofreading. 

Kinetic proofreading leads to futile cycles in which at 
least part of the input free energy is dissipated with- 
out elongating the nascent polypeptide by an amino acid 
monomer. The eff ects of the consequent loose mechano- 
chemical coupling |144l | of the translational machinery on 
the rate of po lypeptide synthesis has been investigated 

Occasionally wrong monomers escape detection in 
spite of elaborate selection procedure. In case of DNA 
replication, the polymerase normally detects the error 
immediately and corrects it before elongating it further. 
For this purpose, the nascent DNA is transferred to an- 
other specific site on the same polymerase where the 
wrong monomer is cleaved before transferring the nascent 
DNA back to the site of elongation activity. The inter- 
play of the two contradictory activities of elongation and 
shortening of a nascent DNA exposes leads to the cou- 
pling of their corresponding rates in t he overall rate of 
DNA polymerization by a DdDP [l47 [. 



D. Beyond average: dw^ell time distribution (DTD) 

Two motors with identical average velocities may ex- 
hibits widely different types of fluctuations. Suppose the 
successive mechanical steps are taken by the motor at 
times ti, t2, ...,i„_i,in,in-i-i, •■•■ Then, the time of dwell 
before the fc-th step is defined by Tk ~ tk — tk-i- In 
between successive steps, the motor may visit several 
"chemical" states and each state may be visited more 
than once. But, the purely chemical transitions would 
not be visible in a mechanical experimental set up that 
records only its position. The number of visits to a given 
state and the duration of stay in a state in a given visit 
are random quantities. 

In order to appreciate the origin of the fiuctions in the 
dwell times, let us consider the simple iV-step kinetics: 



Ml ^ M2 ^ M3... ^ M, 



M 



N 



(35) 



Suppose t^ ,y is the duration of stay of the motor in the 
/i-th state during its i^-th visit to this state. If r is the 
dwell time, then 



N n^ 

p,— 1 1^— 1 



(36) 
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where ri^; is the number of visits to the /i-th state. It is 
straightforward to check that 

N 

<T > = ^ < n^ >< i^ > 

(37) 

where < n^ > is the avarege number of visitits to the 
/i-th state and < t^ > is the avarege time of dweh i n th e 
/i-th state is a single visit to it. More interestingly |l48l |. 

N 
<t'^ > - <T >^= ^(< tl> - <t^ >2) < n^ > 



M=l 



N 



^(<n2 >-<n^>2)<i^>2 



f,=l 



2 ^ (< rif.n^ > - < n^ >< n^ >) < t^ >< t^ > 



(38) 



^i,>v 



The first and second terms on the right hand side of i 
capture, respectively, the fluctuations in the lifetimes of 
the individual states and that in the number of visits 
to a kinetic state. Note that the number of visits to a 
particular state depends on the number of visits to the 
neighboring states on the kinetic diagram; this interstate 
correlation is captured by the third term on the right 
hand side of (|38l). 

Several different analytical and numerical techniques 
have been developed for calculation of the dwell time 
distribution [120, 149 15l| . Sin ce the dwell time is essen- 
tially a first passage time [l52j . an absorbing boundary 
method Il50l has been used. 



1 . D TD for a motor that never steps backward 

As an example, we consider again the simple scheme 
dH). In this case, the DTD is 



fit) 



UJ1UJ2 



{e- 



') 



(39) 



where 



Wl + W_l +UJ2 , 
L0± = ± 



{UJI +UJ-1 +UJ2Y 



UJIUJ2 



(40) 

In the special case uj-i — 0, aj+ = wi and a;_ = lj2 and, 
hence. 



fit) 



UJ1UJ2 
U02 ~ 1^1 



ie 



-UJit 



-UJ2t 



(41) 



Similar sum of exponentials for DTD have been derived 
also for machines with more c omplex rnechan o-chemical 
kinetics (see, for example, refs. [l2ll Il46l . Il53| ) . 



It is possible to establish on general grounds that, for a 
motor with N rnechano-chemical kinetic states like (|35l) , 
the DTD is a sum of n exponentials of the form J148J | 



N 



/(t) = ^Qe 



(42) 



where A^ — 1 of the N coefficients Cj (1 < j < N) are 
independent of each other because of the constraint im- 
posed by the normalization condition for the distribution 
fit). Also note that the prefactors Cj can be both posi- 
tive or negative while uij > for all j . 

If we consider an even more special case of the scheme 
© where w_i — 0,uJi = uj2 = ^, i-O-, all the steps are 
completely irreversible and take place at the same rate 
to, the DTD becomes the Gamma-distribution 



fit) = 



^N^N-l^-u^t 



T{N) 



(43) 



where r(A'^) is the gamma function with N — 2. In- 
terestingly, for the Gamma-distribution, the rand omness 
parameter |154i] (also called the Fano factor |l55J ]') 



(< r^ > - < T >^ 



(44) 



is exactly given by r = 1/N. Can the experimentally 
measured DTD be used to determine the number of states 
A^? Unfortunately, for real motors, (i) not each step of 
a cycle is fully irreversible, (ii) the rate constants for dif- 
ferent steps are not necessarily identical, (iii) branched 
pathways are quite common. Consequently, l/r may pro- 
vide just a bound on the rough estimate of A^. 

Can one use the general form (|^^ of DTD to extract 
all the rate constants for the kinetic model by fitting it 
with the experimentally measured DTD? The answer is: 
NO. First, even if a good estimate of A^ is available, the 
number of parameters that can be extracted by fitting 
the experimental DTD data to ([42)1 is 2A^ — 1 (n values 
of ujj and A^ — 1 values of Cj). On the other hand, the 
number of possible rate constants may be much higher 
[l48J |. For example, if transitions from every kinetic state 
to every other kinetic state is allowed, the total number 
of rate constants would be A^(A^ — 1). In other words, in 
general, the kinetic rate constants are underspecified by 
the DTD. Second, as the expression ^^ for the DTD of 
the example ^ shows expplicitly, the w's that appear in 
the exponentials may be combinations of the rate con- 
stants for the distinct transitions in the kinetic model. 
It is practically impossible to extract the individual rate 
constants from the estimated w's unless any explicit rela- 
tion like (PD|) between the estimated uj's and actual rate 
constants is apriori available. 



2. Conditional DTD for motors with both forward and 
backward stepping 

For motors which can step both forward and backward, 
more relevant information on the kinetics of a motor are 
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contained in the conditional dwell time distribution [15; 
If n_|_ and n_ are the numbers of forward and backward 
steps, respectively, then for large n = n+ + n_, the cor- 
responding step splitting probabilities are 11+ = n^/n 
and n_ = u-jn. The dwell times before a forward step 
and before a backward step can be measured separately. 
Hence, the prior dwell times t^ and r_ can be obtained 
by restricting the summations in 






Tk 



(45) 



to just forward (+) or just backward (-) steps, respec- 
tively. In terms of splitting probabilities and prior dwell 
times, the mean dwell time (r) can be expressed as 



(r) = Ii+r\ 



n_ 



(46) 



Compared to the prior dwell times, more detailed in- 
formation on the stepping statistics is contained in the 
four conditional dwell times, which are defined as follows: 






= dwell time between a 
= dwell time between a 
= dwell time between a 
= dwell time between a 



step followed by a 
step followed by a 



X-ray diffraction and electron microscopy, two of the 
most powerful biophysical techniques, yield the structure 
of molecular machines averaged over an ensemble. Elec- 
tron microscopy is a powerful alternative for the deter- 
mination of structures of those macromolecules whose 
cry stals are not available. Cryo-electron microscopy 
[l|, Il58l | combines the technique of electron microscopy 
with cryogenics-based sample prep aration. 

The single-molecule techniques J159l - ll64j can be fur- 
ther classified into two groups: (i) methods of imaging, 
and (ii) methods of manipulation. These are not mutu- 
ally exclusive and can be probed by a single experimental 
set up. The most direct single-molecule imaging is car- 
ried out by fluorescence-based optical microscopy. Tra- 
ditional diffraction-limited optical microscopy provides a 
"hazy" image of the machine. However, modern opti- 
cal nanoscopy has broken the resolution limit imposed 
by diffraction 'l65j. Fluorescence microscopy provides a 
glimpse (howsoever hazy) of single molecules. Imaging 
a fluorescently labelled molecular machine in real time 
stcpenables us to study its dynamics just as ecologists use 
step "radio collars" to track individual animals. 



step followed by a -I- step 



The techniques developed for the mechanical manipu- 



step followed by a 



lation of a single machine can be classified on the basis 
Pof tranduccrs used; the table below provides a summary. 

(47) 



Mechanical manipulation of a single biomolecule 



It is 


helpful 


to 


introduce pairwise step 


probabilities 


n++, 


n+- 


,n- 


+ ,H__. Note that H++ 


+ H+_ = 1, and 


n-+ 


+ H_ 


__ = 


= 1. 


Neglecting finite time corrections. 


n++ 


= n++/{ 


n++ +n+-), H+_ = n+ 


-/(" 


++ + n+-) 


n-+ 


= n_ 


-+/( 


n \- + n ), H = n_ 


-/(" 


-++n—) 














(48) 








< 


= n++r+++H+_T_ 


-+ 










r^ 


= H_+r+_+H__r_ 





(49) 



Mechanical transducers 



\ 



Field-based t ransducers 



SFM Micro-needle 



EM-field 



Flow-field 



Electric field 



Ma gnetic field 



Defining the post dwell times r^ in a fashion similar to 
that used for defining the prior dwell times, we get 



n++r+H 

H_+r__ 



n+-r+_ 

H__r__ 



(50) 



and 



(r) = H+r; 



H 



This formalism has been applied, 
headedkinesin KIFIA Il57ll. 



_r- (51) 

for example, single- 



VII. A SUMMARY OF EXPERIMENTAL 

TECHNIQUES: ENSEMBLE VERSUS 

SINGLE-MACHINE 

The experimental techniques for probing the struc- 
ture and dynamics of molecular machines can be di- 
vided broadly into two groups: (i) ensemble-averaged, 
(ii) single-machine. 



VIII. SOLVING INVERSE PROBLEM BY 

PROBABILISTIC REVERSE ENGINEERING: 

FROM DATA TO MODEL 

A discrete kinetic model of a molecular motor can be 
regarded as a network where each node represents a dis- 
tinct mechano-chemical state. The directed links denote 
the allowed transitions. Therefore, such a model is unam- 
biguously specified in terms of the following parameters: 
(i) the total number N of the nodes, (ii) the N x N ma- 
trix whose elements are the rates of the transitions among 
these states; a vanishing rate indicates a forbidden direct 
transition. 

In the preceeding sections we handled the "forward 
problem" by starting with a model that is formulated 
on the basis of apriori hypotheses which are, essen- 
tially, educated guess as to the mechano-chemical kinet- 
ics of the motor. Standard theoretical treatments of the 
model yields data on various aspects on the modelled 
motor; this approach is expressed below schematically. 



Theoretical model 



Expcrinrcntal data 



Consistency between theoretical prediction and experme- 
ntal data validates the model. However, any inconsis- 
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tency between the two indicates a need to modify the 
model. 

In most real situations the numerical values of the rate 
constants of the kinetic model are not known apriori. In 
principle, these can be extracted by analyzing the exper- 
imental data in the light of the model. However, not only 
the rate constants but also the number of states and the 
overall architecture of the mechano-chemical network as 
well as the kinetic scheme postulated by the model may 
be uncertain. In that case, the experimental data should 
be utilized to "select" the most appropriate model from 
among the plausible ones. In fact, more than one model, 
based on different hypotheses, may appear to be consis- 
tent with the same set of experimental data within a level 
of accuracy. The experimental data can be exploited at 
least to "rank" the models in the order of their success 
in accounting for the same data set. 

The "inverse problem" of inferring the model from the 
observed experimental data has to be based on the the- 
ory of probability. Such "statistical inference" |166J can 
be drawn by following methods developed by statisti- 
cians over the last one century. Inferring the complete 
network of mechano-chemical states and kinetic scheme 
of a molecular machine from its observed properties is 
reminiscent of inferring the operational mechanism of a 
given functioning macroscopic machine by "reverse en- 
gineering" . This inverse problem, which is the main 
aim of this section, is expressed below schematically. 



Theoretical model 



Experimental data 



It is worth emphasizing that both the directions of in- 
vestigations, i.e. the forward problem and the inverse 
problem ar e eq ually important and complementary to 
each other [l67J |. 



A. Frequentist versus Bayesian approach 

Suppose, m be a column vector whose M compo- 
nents are the M parameters of the model, i.e., the trans- 
pose of TO is TO — (to,i, ?Ti2, ..., to,m) Let the data ob- 



tained in N observations of this model are represented 
by the A^-component column vector d whose transpose 
is d = {di,d2, ■■■,dN)- Our "inverse problem" is to in- 
fer information on to from the observed d. The philoso- 
phy underlying the frequentist approach, i.e., approaches 
based on maximum-likelihood (ML) estimation and the 
Bayesian approach for extracting these information are 
different in spirit, as we explain in the next two subsub- 
sections 168]. 

For simplicity, let us assume that a device has only two 
possible distinct states denoted by £i and £2- 



£1 ^^£2 

kr 



(52) 



Let us imagine that we are given the actual sequence of 
the states, over the time interval < i < T, generated 



by the Markovian kinetics of the device. But, the mag- 
nitudes of the rate constants kf and k^. are not supplied. 
We'l l now formulate a method, based on ML analysis 
|169l | to extract the numerical values of the parameters 
kf and kr- 

Suppose t, and t, denote the time interval of the j- 
th residence of the device in states £1 and £2, respectively. 
Moreover, suppose that the device makes Ni and N2 vis- 
its to the states £1 and £"2, respectively, and N — N1+N2 
is the total number of states in the sequence. Therefore, 
total time of dwell in the two states are Ti = X^jii 4 

and T2 = Ef=i tf where Ti+T2= T. 

Since the dwell times are exponentially distributed for 
a Poisson process, the likelihood of any state trajectory 
S is the conditional probability 



P{S\kf,kr) 



nfi,fc,e-^/*. 



tA'2 



^^7 — 1 >^r^ 



.„/2) 



-fcrt 



fcfe-^/^M(fc^e-'=^^^ 



Maximum-likelihood estimate 



(53) 



ML approcah |17Cl| is based on finding the estimates 
of the set of model parameters that corresponds to the 
maximum of the likelihood P{d\rn) for a fixed set of data 
d. 

For the kinetic scheme shown in equation ([5^ , the the 
ML estimates of kf and kr are obtained by using ([53)) 
in d[lnP{S\kf,kr)]/dkf = = d[lnP{S\kf,kr)]/dkr. It 
is straightforward to see |l69l | that these estimates are 
kf = Ni/Ti and kr = N2/T2. 



2. Bayesian estim,ate 

For drawing statistical inference regarding a kinetic 
model, the Bayesian ap proach ha s gained increasing pop- 
17ll - ll76{ . The areas of research 



ml 

1811, 



ularity in recent years 

where this has been applied successfully inclu de variou s 

biological pro cess es in, for example, gene tics [l77l 

biochemistry |l79l | , cognitive sciences [180| , ecology 

etc. 

In the Bayesian method there is no logical distinc- 
tion between the model parameters and the experimental 
data; in fact, both are regarded as random. The only dis- 
tinction between these two types of random variables is 
that the data are observed variables whereas the model 
parameters are unobserved. The problem is to estimate 
the probability distribution of the model parameters from 
the distributions of the observed data. 
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The Bayes theorem states that 

P{d\m,)P{fn) 



P(m|i 



Pid) 



where P{d) can be expressed as 

P{d) = / P{d\m}P{m)dm 



(54) 



(55) 



The likelihood P{d\m) is the conditional probability for 
the observed data, given a set of particular values of 
the model parameters, that is predicted by the kinetic 
model. However, implementation of this scheme also re- 
quires P{m) as input. In Bayesian terminology P{rn) 
is called the prior because this probability is assumed 
apriori by the analyzer before the outcomes of the exper- 
iments have been analyzed. In contrast, the left hand 
side of equation (|54l) gives the posterior probability, i.e., 
after analyzing the data. 

Thus, an experimenter learns from the Bayesian anal- 
ysis of the data. Such a learning begins with an input 
in the form of a prior probability; the choice of the prior 
can be based on physical intuition, or general arguments 
based, for example, on symmetries. Prior choice can be- 
come simple if some experience have been gained from 
previous measurements. Often an uniform distribution 
of the model parameter (s) is assumed over its allowed 
range if no additional information is available to bias its 
choice. To summarize, Bayesian analysis needs not just 
the likelihood P{d\rn) but also the prior P{rn). 

For the kinetic scheme shown in equation ([52]) . the 
Bayes' theorem ([M]) takes the form 

P{S\kf,kr)P{kf,kr) 
P{kf,kr\S) = — ' ' 



P(5) 
P{S\kf,kr)P{kf,kr) 



(56) 



We now assume a uniform prior, i.e., constant for positive 
kf and kr, but zero otherwise. Then, P{kf,kr\S_) is pro- 
portional to the likelihood function P{S\kf,kr) (within 
a normalization factor). Normalizing, we get 

P{kf,kr\S) 



rpN 1 + 1 

T{Ni + 1) 



fcfe-'^^^i 



rpN2 + l 

^2 k^^t 
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on the recording of the position of the motor in a single 
motor experiment. This problem is similar t o an older 
problem in cell biology: ion-channel kinetics (l82l Il83j | . 
Current passes through the channel only when it is in 
the "open" state. However, if the channel has more than 
one distinct closed states, the recordings of the current 
reveals neither the actual closed state in which the chan- 
nel was nor the transitions between those closed states 
when no current was recorded. 



-k^T2 



(57) 

The mean of kf is A^i -I- 1)/Ti whereas the most-likely es- 
timate is Ni/Ti. Similarly, the mean and most probable 
estimates of kr are obtained by replacing the subscripts 
1 by subscripts 2. Moreover, the variance of kf and kr 
are (A^i -f l)/T^ and (iVa + l)/r|, respectively. 



3. Hidden Markov Models 

The actual sequence of states of the motor, generated 
by the underlying Markovian kinetics, is not directly visi- 
ble. For example, a sequence of states that differ "chem- 
ically" but not mechanically do not appear as distinct 



Hidden Markov Model (HMM) fl84l-[l87ll ha s been 
applied to analyze FRET trajector ies llSSl Il89l| , step- 
ping recordings of molecula r mo tors [iQCt Il9l| , and acto- 
myosin contractile system [l92J | to extract kinetic infor- 
mation. 

For a pedagogical presentation of the main ideas be- 
hind HMM, we start with the kinetic scheme shown in 
(I54p and a simple, albeit unrealistic, situation and then 
by gradually adding more and more realistic feature s, ex - 
plain the main concepts in a transparent manner [l69|. 
First, suppose that the actual sequence of states (trajec- 
tory) itself is visible; this case can be analyzed either by 
the ML-analysis of by Bayesian approach both of which 
we have presented above. We now relax the strong as- 
sumption about the trajectory and proceed as below. 

•If state trajectory is hidden and visible trajec- 
tory is noise-free 

The sequence of states of the device is, as before, gen- 
erated by a Markov processs which is hidden. Suppose 
the device emits photons from time to time that are de- 
tected by appropriate photo-detectors. For simplicity, we 
assume just two detection channels labelled by 1 and 2. 
For the time being, we also assume a perfect one-to-one 
correspondence between the state of the light emitting 
device and the channel that detects the photon. If the 
channel 1 (2) clicks then the light emitting device was 
in the state £i {£2) at the time of emission. The inter- 
val Atj — tj^i — tj between the arrival of the j'-th and 
i ~V l-th photons (1 < j < N) is random. 

Thus, from the photo-detectors we get a visible se- 
quence of the channel index (a sequence made of a bi- 
nary alpha bet) which we call "noiseless photon trajec- 
tory" jl69l | . The sequence of states in the noiseless photo 
trajectory is also another Markov chain that is conven- 
tionally referred to as the "random telegraph process" . 
Note that the photon detected by channel 1 (or, channel 
2) can take place at any instant during the dwell of the 
device in state 1 (or, state 2). Therefore, the sequence 
of states in the noiseless photon trajectory does not re- 
veal the actual instants of transition from one state of 
the device to another. 

Since the noiseless phton trajectory corresponds to a 
random telegraph process, the transition probabilities for 
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this process are 

Pi£i\ £2;kf,kr,Atj ) = 

P{£2\ £i;kf,kr,Atj ) = 

P{£2\£2;kf,kr,Atj) = 









y 



,-(fc/ + fe^)Atji 



_p-(fc/ + fer)Aijl 



/v ^' n~ f£>^ Pb I' ~\~ Kij- 



where P(£^\£^] kf, kr, Atj) is the conditional probabihty 
that state of the device is £^ given that it was in the state 
£,^ at a time At earher. 

The hkehhood of the visible data sequence {V} is now 
given by 

Pi{V}\kf,kr) = PiVl\kf,kr)UfS^^PiVj + l\Vj;kf,kr,Atj) 

(59) 
where the first factor on the right hand side is the ini- 
tial probability (usually assumed to be the equilibrium 
probability). The transition probabilities on the right 
hand side of equation (j59p are the conditional probabili- 
ties given in equation (|58l) . Unlike the previous simpler 
case, where the state sequence itself was visible, no ana- 
lytical closed-form solution is possible in this case. Nev- 
ertheless, analysis can be carried out numerically. 
•If state trajectory is hidden and visible trajec- 
tory is noisy 

In the preceeding case of a noise- free photon trajectory, 
we assumed that from the channel index we could get 
perfect knowledge of the state of the emitting device. 
More precisely, the conditional probabilities were 



Thus, in this case, the relation between the states of the 
hidden and visible states is not one-to-one, but one-to- 

-(kf+kr)Atj many. Therefore, given a hidden state of the device, a set 
of "emission probabilities" determine the probability of 
each possible observable state; these are listed in equa- 
tions ([CTjl for the device (|5^ . 

•HMM: formulation for a generic model of molec- 
ular motor 

On the basis of the simple example of a 2-state sys- 

-{kf+kr)Atj tem presented above, we conclude that, for data analysis 
based on a HMM four key ingredients have to be speci- 
(58) fied: 

(i) the alphabet of the "visible" sequence {fj.} (1 < ^ < 
N), i.e., N possible distinct visible states; (ii) the alpha- 
bet of the "hidden" Markov sequence {j} (1 < j < M), 
i.e., M allowed distinct hidden states, (iii) the hidden- 
to-hidden transition probabilities W{j -^ fc), and (iv) 
hiddcn-to-visible emission probabilities _B(j -^ /i). In ad- 
dition to the transition probabilities and emission proba- 
bilities, which are the parameters of the model, the HMM 
also needs the initial state of the hidden variable (s) as in- 
put parameters. 

Suppose P{{V}\ HMM,{X} ) denotes the probability 
that an HMM with parameters {A} generates a visible 
sequence {V}. Then, 



P(M£i) = 


= 1 


^(11^2) = 


= 


F(2|fi) = 


= 


^(2|f2) = 


= 1 



However, in reality, background noise is unavoidable. 
Therefore, if a photon is detected by the channel 1, it 
could have been emitted by the device in its state £i 
(i.e., it is, indeed, a signal photon) or it could have come 
from the background (i.e., it is a noise photon). Suppose 
Ps is the probability that the detected photon is really a 
signal that has come from the emitting device. Suppose 
Pbi is the probability of arrival of a background photon in 
the channel 1 . The probability that a back ground photon 
arrives in channel 2 is 1 — pbi . Then |169| , 

E{1\££) 
E{2\£^) 
Eim) 
E(2\£2) 



Ps + {1- Ps)Pbl 

1-P{l\£i) = il-P,){l-Pbi) 

(1 -Ps)Pbl 

l-P{l\£2)^Ps + {l-Ps){l-Pb) 



Visible: 
Hidden: 
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P{{V}\HMM,{X}) 



{H} 



P{{V}\ {H};{X} )P{{H}\{\}) 

(62) 
where P{{H}\{X}) is the conditional probability that the 
HMM generates a hidden sequence {H} for the given 
parameters {A} and P({y}|{i/}; {A}) is the conditional 
probability that, given the hidden sequence {H} (for pa- 
rameters {A}) the visible sequence {V} would be ob- 
tained. 

Once P{{V}\HMM,{X}) is computed, the parame- 
(60) ter set {A} are varied to maximize P{{V}\HMM,{X}) 



(61) 



(for the convenience of numerical computation, often 
\nP{{V}\HMM,{X}) is maximized. The total number 

of possible hidden sequences of length T is T^'^^ . In 
order to carry out the summation in equation (|62p one 
has to enumerate all possible hidden sequences and the 
corresponding probabilities of occurrences. A successful 
implementation of the HMM requires use of an efficienct 
numerical algorithm; the Viterbi algorithm is one such 
candidate. 

In case of a molecular motor, the "chemical states" 
are not visible in a single molecule experiment. More- 
over, even its mechanical state that is "visible" in the 
recordings may not be its true position because of (a) 
measurement noise, and (b) steps missed by the detector. 
Let is denote the "visible" sequence by the recorded posi- 
tions {Y} whereas the hidden sequence is the composite 
mechano-chemical states {X, C} where X and C denote 
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the true position and chemical state, respectively. The In this case, 

transition probabilities are denoted by W{Xt-i,Ct-i -^ 

Xt, Cf) O ne possible choice for the emission probabilities x-^ 

sLm Pi{Y}\ HMM,{X} )= y^ P{{Y}\ {X,C};{X} )P{{X,C}\{X}) 

{X,C} 

I i (Y -x)^ C^^) 

E{Xt ^ Yt) - Jj^^-^exp[^ ^ \^ ,;^ ] (63) where 



(27ra2) 



(2^2) 



P({X, C}|{A}) = Px„,c„W^(^o, Co ^ Xi, Ci)W^(Xi, Ci -^ ^2, C2) W^(Xt-i, Ct-1 -> X^, Ct) (65) 



and 



P{{Y}\ {X,C};{\] ) = EiXo ^ Yo)EiXi ^ yi)...^(Xt ^ y^-^l^T ^ >t) 

I 



(66) 



The usual strategy |188l [ISO!] consists of the following 
steps: Step I: Initialization of the parameter values for it- 
eration. Step II: Iterative re- estimation oi parameters for 
maximum likelihood: the parameters {W{Xt-i,Ct-i -^ 
Xt,Ct)}, {E{Xt — !> Yt)} andPxo^Co are re-estimated iter- 
atively till P{{Y}\IIMM, {A}) saturates to a maximum. 

Step III: Construction of "idealized" trace: using the 
final estimation of the model parameters, the position of 
the motor as a function of time is reconstructed; natu- 
rally, this trace is noise-free. Step IV: Extraction of the 
distributions of step sizes and dwell times: from the ideal- 
ized trace, the distributions of the steps sizes dwell times 
are obtained. These distributions can be compared with 
the corresponding theoretical predictions. 



chemical states. It can be shown 193] that 
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(68) 

Based on th is observation, a prescription has been sug- 
gested [l93| to extract 0(a;) by analyzing the time se- 
ries of motor positions obtained from single-motor ex- 
periments. 



IX. OVERLAPPING RESEARCH AREAS 

A. Symmetry breaking: directed motility and cell 
polarity 



B. Extracting FP- based model from data? 

In this section so far we have discussed methods for ex- 
tracting the master-equation based models that describe 
the kinetics of motors in terms of discrete jumps on a fully 
discrete mechano-chemical state space. However, as we 
summarized in section [Vj kinetics of molecular motors 
can be formulated also in terms of FP equations which 
treat space as a continuous variable. Can one extract 
the parameters of such FP-based models from the data 
collected from single-molecule experiments? 

One of the key ingredients of the FP-based approach is 
the profile of the potential V^ (x) felt by the motor in the 
"chemical" state /i. To my knowledge, it has not been 
possible to extract it from any experimental data. Now, 
let us define 



d(j){x)/dx 
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E^^m(^) 



5]P^(x)[dV^^(x)/& 



(67) 



to be the profile of the potential averaged over all the 



Energy is a scalar quantity whereas velocity is a vector. 
How does consumption of energy give rise to a non-zero 
average velocity of a molecular motor? Moreover, a di- 
rected movement that a motor exhibits on the average, 
requires breaking the forward-backward symmetry on its 
track. What are the possible cause and effects of this 
broken symmetry at the molecular level? 

As far as the cause of this asymmetry is concerned, the 
asymmetry of the tracks alone cannot explain the "di- 
rected" movement of the motors, because on the same 
track members of different families of motors can, on 
the average, move in opposite directions. Obviously, the 
structural design of the motors and their interactions 
with the respective tracks also play crucial roles in deter- 
mining their direction of motion along a track. Further- 
more, this broken symmetry at the molecular level, e.g., 
the "directed" movement of molecular motors, has im- 
portant effects on various biological phenomena, partic- 
ularly "vectorial" processes, at the sub-cellular and cel- 
lular levels. In general, a cell is not isotropic. Motors are 
essential in breaking the cellular symmetry |194| . Can 
we establish a unique s et o f bas ic principle underlying 
the symmetry breaking J194ill95| ? 
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Therefore, the cause and effects of broken symmetry 
of molecular motors can be examined in the broader con- 
text of the fundamental princ iples of symmetry breaking 
in physics and biology [l96| (see also other articles in 
the s pecia l "Perspectives on Symmetry Breaking in Biol- 
ogy" [197] )■ For macroscopic systems in thermodynamic 
equilibrium, symmetry breaking is explained in terms of 
the form of the free energy. However, since living cells 
are far from thermodynamic equilibrium, the theory of 
symmetry breaking in those systems cannot be based on 
thermodynamic free energy. Kinetics cannot be ignored 
in the study of symmetry breaking in living systems. 



B. Self-organization and pattern formation: 
assembling machines and cellular morphogenesis 

The interior of a living bacterial cell is far from ho- 
mogeneous; the intracellular space of eukaryotes are di- 
vided into separate compartments. Scaling is one of the 
interesting properties of many physical quantities that 
gives rise to some well known universalities. The scal- 
ing prope rties of a cell and the subcellular compartments 
[IQSl ll99|. often depend on the machineries which as- 
semble them. 

The size, shape, location and number of intracellu- 
lar compartments as well as modular intracellular ma- 
chineries are self-organized, rather than self-assembled. 
Dissipation takes place in "self-organization" and distin- 
guishes it from "self-assembly" ; the latter corresponds to 
the minimum of thermodynamic free energy whereas self- 
organize d systeni does not attain thermodynamic equi- 
librium pM[200l - |202] . Molecular motors and their fila- 
mentous tracks play imp ortant ro les in the intracellular 
self-organization process [l95l |200J | and even in the cellu- 
lar morphogenesis which may be regarded as a problem 
of pattern formation far from equilibrium. 



C. Dissipationless computation: polymerases as 
"tape-copying Turing machines" 

The concept of information in the context of biolog- 
ical systems has been discussed at length in the past 
(203J |. The operation of molecular machines involved in 
genetic processes can be analyzed in terms of storage and 
transmission of information. In fact, a particular class of 
machines carry out what may be viewed as data trans- 
mission whereas that of ot hers may be viewed as digital- 
to- analog conversion [204l | . For these obvious reasons, 
a broad class of molecular machines are interesting also 
from the perspective of information theory, electronic en- 
gineering and computer science. 

Computation can be viewed as a transition from one 
state to another. However, in a conventional digital com- 
puter an elementary operation is logically irreversible. To 
understand the meaning of this term, consider the two 
summations 3 -f 1 = 4 and 2 -f 2 = 4. If the computer 



retains only the output, i.e., 4 and erases the input num- 
bers (i.e., 3 and 1, or 2 and 2, as the case may be) after 
summing the two input numbers, then, given only the 
output (i.e., 4) it is impssible to figure out whether the 
input were 3,1 or 2,2. Note that erasing every bit leads 
to loss of information which may also be interpreted as 
an increase of entropy. However, strategies for logically 
reversible computation have been developed [205, 206]. 
For example, the simplest strategy for logically reversible 
computation is based on the prescription that neither the 
initial input nor the data in any intermediate step should 
be erased; instead, these should be retained in an auxil- 
iary register. 

In practice, computation is carried out with a device 
that is governed strictly by the laws of physics that in- 
cludes thermodynamics. Since entropy increases in ev- 
ery irreversible physical process, it would be tempting to 
associate the logical irreversibility of computation with a 
physical irreversibility of the computational device. Since 
each bit of a classical digital computer has only two pos- 
sible states, at first sight, one would e xpec t dissipation of 
ksT ln2 energy for every bit erased [207|. But, the pos- 
sibility of logically reversible computation raises a funda- 
mental question: is it possible for a physical computing 
device to carry out physically reversible (and, hence non- 
dissipative) computation? 

Operation of a polymerase (and that of a ribosome) 
can be regarded as computation. More precisely, such 
a computing machine can be viewed as a "tape-copying 
Turing machine" that polymerizes its output tape , in- 
stead of merely writing on a pre-synthesized tape |205j | 
Dissipationless operation of these machines is possible 
only if every step is error-free which, in turn, is achiev- 
able in the vanishingly snail speed, i.e., reversible limit 



D. Enzymatic processes: conformational 
fluctuations, static and dynamic disorder 

For a motor that doesn't step backwards, the position 
advances in the forward direction one step at a time; how- 
ever, the time gap between the successive steps, i.e., dwell 
time, is a random variable. Similarly, in single-molecule 
enzymology, the population of the product molecules in- 
creases by one in each enzymatic cycle, the time gap be- 
tween the release of the successive pr oduct mo lecules is 
random. The distribution of this time |209ll210| is analo- 
gous to that of the dwell times of molecular motors |l20j | . 
Therefore, the research fields of single-motor mechanics 
and single-enzyme reactions can enrich each other by ex- 
change of concepts and techniques Il48| . 

As a concrete example, consider the chemical reaction 



E + S^^h^E- 



P 



(69) 



catalyzed by the enzyme E where S and P denote the 
substrate and product, respectively. The enzyme and 
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the substrate form, at a rate wi, an intermediate com- 
plex /i that can either dissociate into the free enzyme 
and the substrate at a rate uj^i, or get converted irre- 
versibly into the product and free enzyme at a rate W2- 
In a bulk sample, where a large number of enzymes con- 
vert many substrates into products by catalyzing this re- 
action simultaneously, the measured rate of the reaction 
is actually an average over the ensemble. This rate was 
calculated by Michaelis and Menten about 100 years ago 
and is given by the celebrated Michaelis-Menten equation 

Formally, the scheme (I69[) is very similar to the scheme 
(|9]) that we presented earlier as a very simple example of 
the mechano-chemical cycle of a molecular motor. The 
time taken by the chemical reaction ((69)) fluctuates from 
one enzymatic cycle to another. One of the fundamen- 
tal questions is: what are the conditions under which 
the average rate would still satisfy the Michaelis-Menten 
equation [209 , 210] ? 

Another topic that overlaps research on molecula r mo - 
tor kinetics and chemical kinetics is allosterism [212[. 
A motor protein has separate sites for binding the fuel 
molecule and the track. How do these two sites commu- 
nicate? How does the binding of ligand at one site affect 
the binding affinity of the other? The mechanochemical 
cycle of a motor can be analyzed [213| from the perspec- 
tive of allosterism which is one of the key mechanisms of 
cooperativity in protein kinetics J214l |. 



E. Applications in biomimetics and 
nano-technology 

Initially, technology was synonymous with macro- 
technology. The first tools applied by primitive humans 
were, perhaps, wooden sticks and stone blades. Later, as 
early civilizations started using levers, pulleys and wheels 
for erecting enormous structures like pyramids. Until 
nineteenth century, watch makers were, perhaps, the only 
people working with small machines. Using magniiying 
glasses, they worked with machines as small as 0.1mm. 
Micro-technology, dealing with machines at the length 
scale of micrometers, was driven, in the second half of 
the twentieth century, largely by the computer miniatur- 
ization. 

In 1959, Richard Feynman delivered a talk J215l | at a 
meeting of the American Physical Society. In this talk, 
entitled " There 's Plenty of Room at the Bottom" , Feyn- 
man drew attention of the scientific community to the 
unlimited possibilities of manupilating and controlling 
things on the scale of nano-meters. This famous talk is 
now regarded by the majority of physicists as the defin- 
ing moment of nano-technology i216]. In the same talk, 
in his characteristic style, Feynman noted that "many 
of the cells are very tiny, but they are very active, they 
manufacture various substances, they walk around, they 
wiggle, and they do all kinds of wonderful things- all on 
a very small scale" . 



From the perspective of applied research, the natural 
molecular machines opened up a new frontier of nano- 
technology 217l - l222j |. The miniaturization of compo- 
nents for the fabrication of useful devices, which are es- 
sential for modern technology, is currently being pursued 
by engineers following mostly a top-down (from larger 
to smaller) approach. On the other hand, an alternative 
approach, pursued mostly by chemists, is a bottom-up 
(from smaller to larger) approach. 

Unlike man-made machines these are products of Na- 
ture's evolutionary design over billions of years. In fact, 
cell h as b een compared to an "archeological excavation 
site" [223, the oldest modules of functional devices are 
the analogs of the most ancient layer of the exposed site 
of excavation. We can benefit from Nature's billion year 
experience in nano-technolgy. The te rm biomi metics has 
already become a popular buzzword |220l l22lj | ; this field 
deals with the design of artificial systems utilizing the 
principles of natural biological systems. 



X. CONCEPT OF BIOLOGICAL MACHINES: 
FROM ARISTOTLE TO ALBERTS 

The concept of "living machine" evolved over many 
centuries. Some of the greatest thinkers in human history 
made significant contributions to this concept. It started 
with the man-machine (ancL more generally, animal- 
machine) analogy. Aristotle J224j distinguished between 
the "body" and the "soul" of an organism. However, 
after more than one and half millenia, an intellectu- 
ally provocative idea was put forward by Rene Descartes 
when he argued that animals were "living machines" . 
This idea took its extreme form in Julien OfFray de 
La Mettrie's book |225| L'homme Machine ('man a ma- 
chine'). Leibnitz |226l | wrote that the body of a living 
being is a kind of "divine machine or natural automa- 
ton" and it "surpasses all artificial automata" , because 
not each part of a man-made machine is itself a machine. 
In contrast, he argued, living bodies are "machines in 
their smallest parts ad infinitum" . Is this statement to 
be interpreted as Leibnitz's speculation for the existence 
of machines within machines in a living organism? The 
debate over this interpretation continues [227l |228J | . 

All the great thinkers from Aristotle to Descartes and 
Leibnitz compared the whole organism with a machine, 
the organs being the coordinated parts of that machine. 
Cell was unknown; even micro-organisms became visible 
only after the invention of the optical microscope in the 
seventeenth century. Marcelo Malpighi, father of micro- 
scopic anatomy, speculated in the 17th century about the 
existence molecular machines in living systems. He wrote 
(as quoted in english by Marco Piccolino [229]) that the 
organized bodies of animals and plants been constructed 
with " very large number of machines" . He went on to 
characterize these as "extremely minute parts so shaped 
and situated, such as to form a marvelous organ" . Un- 
fortunately, the molecular machines were invisible not 
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only to the naked eye, but even under the optical micro- 
scopes available in his time. In fact, individual molec- 
ular machines could be "caught in the act" only in the 
last quarter of the 20th century. We highlight here the 
progress made during the last three centuries when grad- 
ually the analogy with machine got extended to cover all 
levels of biological organization- from the topmost level 
of organisms down to cellular and subcellular levels. Not 
surprisingly, muscles seem to have attracted maximum 
attention in the context of machinery of life. 

Thomas Henry Huxley, dubbed as "Darwin's bulldog" 
for his strong support for Darwinian ideas of evolution, 
delivered his famous lecture, titled "On the Physical Ba- 
sis of Life", on 8th November, 1868 (see ref.Q for the 
published version). Among the provocating statements 
and insightful comments in this article, which received 
lot of hostile criticism at that time, I quote only a few 
that are directly relevant from the perspective of molec- 
ular machines. Huxley had the foresight to see that [9;] 
"speech, gesture, and every other form of human action 
are, in the long run, resolvable into mascular contrac- 
tion, and muscular contraction is but a transitory change 
in the relative positions of the parts of a muscle"- a re- 
markably insightful comment on contractility and motil- 
ity because the mechanism of muscle contraction was dis- 
covered almost 90 years later, one of the discoverers being 
his grandson! 

David Ferrier, a pioneering neurologist and a you nger 
contemporary of Thomas Huxley, in his lecture [230j de- 
livered at the Middlesex Hospital Medical School, on 
October 5th, 1870, not only echoed similar ideas but 
stressed the role of physical energy, rather than any hy- 
pothetical vital action, for sustaining life of an organism. 
A few years later, Robert Henry Thurston, the first pres- 
ident of the ASME (American Society of Mechanical En- 
gineers) and a pioneer of modern engineering education, 
investiga ted w hat he called a "vital machine" (or "prime 
motor") [23 1| from an engineer's perspective. 

In the initial stages, most of the visionaries compared 
animals with a machine. Although movements of plants 
received much less attention, results of pioneering sys- 
tematic studies of these phenomena were reported al- 
ready in the n inete enth century by Charles Darwin in 
a classic book J232l | which was co-authored by his son. 
Later, in the preface of his report on the classic investi- 
gations on the mechani cal r esponse of plants to stimuli, 
Jagadis Chandra Bose |233J wrote: "From the point of 
view of its movements a plant may be regarded ...simply 
as a machine, transforming the energy supplied to it, in 
ways more or less capable of mechanical explanation" . 
Interestin gly, t he first chapter was titled "The plant as a 
machine" 
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phenomenon that is now known as cytoplasmic streaming 
0. His speculation that "the cause of these currents 
lie in contraction" was established a century later when 
cytoplasmic streaming was shown to be caused by an 
acto-myosin system. 

"The story of the living machine" J235{ narrated by 
Herbert William Conn, one of the founding members and 
the third president of the Society of Americam Bacteriol- 
ogists (renamed, in 1960, American Society for Microbi- 
ology) , is a critical overview of the understanding of these 
machines at the end of the nineteenth century. Because 
of his deep understanding of the basic principles of phys- 
ical sciences as well as evolutionary biology, his narrative 
on the fundamental principles of living machines remains 
as contemporary today as it was at the time of its publi- 
cation. Conn asked whether the operations of individual 
organisms, i.e., the living machines, co uld b e "reduced 
to the action of still smaller machines" [2355. Conn ar- 
gued that one can look upon each constituent cell of an 
organis m a lso "a little en gine with admirably adapted 
parts" I235J . His summary [235j | that a living organism is 
a "series of machines one within the other" sounds very 
similar to Leibnitz's philosophical idea. Conn went even 
further: "As a whole it is machine, and its parts are sep- 
arate machines. Each part is further made up of still 
smaller machines until we reach the realm of the micro- 
scope. Here still we find the same story. Even the parts 
formerl y ca lled units, prove to be machines". He spec- 
ulated j235j "we may find still further machines within" 
cells. He ended his summary with the statement "And 
thus vital activity is reduced to a complex of machines, 
all acting in harmony with each other to produce together 
the one result- life" ;235 j. 

Conn [235J ] not only discussed "the running of the 
living machine" , but also "the origin of the living ma- 
chine" . In the latter context, while pointing out the dif- 
ferences in the principles of engineering design of man- 
made machines and evolutionary principles of nature's 
nano- machines, he wrote |235j : "It is something as if 



Based on the well known fact that all animals exhibit 
irritability and contractility [2341 , Thomas Henry Huxley 
had already speculated in ref.[9|, that "it is more than 
probable, that when the vegetable world is thoroughly 
explored, we shall find all plants in possession of the same 
powers..". In support of this possibility he described a 



steam engine of Watt should be slowly changed by adding 
piece after piece until there was finally produced the mod- 
ern quadruple expansion engine, but all this change being 
made upon the original engine without once stopping its 
motion." 

Jaques Loeb, a famous embryologist, delivered a se- 
ries of eight lectures at the Columbia university in 1902 
(see ref.[236] for a more complete published version). In 
the first lecture, Loeb started by saying "In these lec- 
tures we shall consider living organisms as chemical ma- 
chines, consisting essentially of colloidal material, which 
possess the peculiarities of automaticall y de veloping, pre- 
serving, and reproducing themselves" [236J. He empha- 
sized the crucial differences between natural and artificial 
machines by the following statement: "The fact that the 
machines which can be created by man do not possess 
the power of automatic development, self-preservation, 
and reproduction constitutes for the pesent a fundamen- 
tal difference between living machines and artificial ma- 
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chines" . However, just like some of his other visionary 
predecessors, he also admitted that "nothing contradicts 
the possibility that the artificial pro duct ion of living mat- 
ter may one day be accomplished" J236l |. 

The concept of "living machine" as a "transformer of 
energy" , from the level of a single cell to the level of a 
multi-cellular organism as complex as a man, found men- 
tion in many lectures and books of the leading biologists 
in the late nineteenth and early twentieh centuries (see, 
for example, [2371] ) . Research on molecular machines was 
focussed almost exclusively on the mechanism of muscle 
contraction during the first half of the 20th century and 
it wa s dominated by Archibald Hill and Otto Meyerhof 
[238l |239] who shared the Nobel prize in Physiology (or 
medicine) in 1922. 

By mid-twentieth century, physical sciences already 
witnessed spectacular progress and life sciences was just 
embarking on its golden period. In his Guthrie lecture 
(also titled "The Physical Ba sis o f Life"), delivered on 
21st November, 1947 (see ref.jli^ for the fuU text), J. 
D. Bernal drew attention of the audience to the structure 
and dynamics of machines. Ten years later, in a review 
article, the title of which aga in had the words "physi- 
cal basis of life" , Schmitt [24ll | made even more concrete 
references to molecular machines covering almost all the 
types of machines that we have sketched in this article. 

The concept of molecular machines J242j | was men- 
tioned on many occasions in the late twentieth century by 
leading molecular biologists who made outstanding con- 
tributions in elucitaing their mechanisms of operation. 
The strongest impact was made, however, by the infiu- 
ential paper of Bruce Alberts |243J | , then the president of 
the National Academy of Sciencs (USA). He wrote that 
"i/ie entire cell can he viewed as a factory that contains 
an elaborate network of interlocking assembly lines, each 
of w hich is composed of a set of large protein machines" 
[243J . 

Why did I start my s tory with Aristotle? In D'Arcy 
Thompson's words [244| "We know that the history of 
biology harks back to Aristotle by a road that is straight 
and clear, but that beyond him the road is broken and 
the lights are dim". Moreover, a section of the mod- 
ern enterprise in animal sciences is pursuing important 
investigations on the efficiency of energy utilization by 
animals |245{ treating an entire animal as an a "combus- 
tion engine" , a modern and scientifically correct version 
of the original philosophical idea developed by Aristotle 
and propagated by his followers. 

Why am I closing my story with Alberts? Because of 
a fortutious coincidence, Alberts' vision for training the 
next generation of biologists for studying natural nano- 
machines coincided with the explosive beginning of nano- 
technology that includes researc h pr ograms on artificial 
nano-machines. Alberts' article |243l | was appeared at a 
time when the statistical physics of Brownian ratchets 
was getting lot of attention 94] . The concept of molec- 
ular machine has matured fully into an area of inter- 
disciplinary research in the twenty-first century. A new 



exciting era of research has just begun! 



XI. SUMMARY AND OUTLOOK 

So far as the current status of our understanding is 
concerned, I would say that till the middle of the 20th 
century we had never seen or manipulated a single molec- 
ular machine although machine-like operation of a cell or 
an entire organism was fairly well established. In the last 
60 years this area has seen explosive growth in activity. 
The structures as well as the mechanisms of many ma- 
chines no longer appear any more mysterious than those 
of macroscopic machines. 

It is practically impossible to predict new ideas in any 
field of research; molecular machines are no exception. 
The reason, as Peter Medawar [246] admitted in his pres- 
idential address at the Cambridge meeting of the British 
Association for the Advancement of Science, is as follows: 
"to predict an idea is to have an idea, and if we have 
an idea it can no longer be the subject of prediction" . 
Therefore, in this section, I do not propose any new idea 
but merely mention a few systems and phenomena which 
need new ideas for their studies and understanding. 

Let me begin with solving the forward problem with 
process modeling. Here we need new ideas to make 
progress in two opposite directions: (a) in-silico modeling 
of single individual machines in terms of its coordinat- 
ing parts in an aqueous medium- aquatic nano-robotics; 
(b) integration of nano-machines and machine-assemblies 
into a micro-factory, the living cell. So far as the point 
(a) is concerned, serious efforts have been ma de by de- 
veloping coarse-grained computational models 247| - |249J 1 
that can be regarded as the substitutes for full molecular 
dynamics based models. However, a technical (or algo- 
rithmic) breakthrough is needed to achieve the ultimate 
goal of "seeing" the operation of a machine in its natu- 
ral aqueous environment by carrying out an experiments 
in-silico. 

Next let me explain the aim of developing integrated 
models. It has been strongly argued at the dawn of this 
millenium 250] that the machinery of life is modular. 
The machineries of transcription, translation, replication, 
chromosome segregation, etc. are all examples of mod- 
ules which perform specific tasks. The components of 
some of these modules are parts of a single machine, e.g., 
all devices participating in translation function on a sin- 
gle platform provided by a ribosome. Signal transduction 
machinery is an extreme example of the other types of 
functional modules which are spatially distributed over 
a significantly large region of intracellular space without 
need for direct physical contacts among the parts. 

The machines that we have considered here are all 
more or less spatially-confined functional modules. Nor- 
mally, functional modules are not completely isolated and 
must communicate and coordinate their function with 
other modules. For example, DNA replication and chro- 
mosome segregation machineries require proper coordi- 
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nation. To my knowledge, no attempt has been made 
so far for quantitative stochastic process modeling incor- 
porating more than one functional module in a seamless 
fashion. Such an enterprise may look like what is now 
the happening in systems biology: to integrate the op- 
erations of machineries for different functional modules 
within a single theoretical framework. 

Finally, let me point out some of the standard prac- 
tices of statistical inference that, to my knowledge, have 
not been followed so far while reverse-engineering molec- 
ular machines. The experimental data for molecular ma- 
chines have been analyzed by several groups to extract 
an underlying kinetic model. However, most often the 
analysis carried out during such reverse engineering is 
based on a single working hypothesis. It would be desir- 
able to follow Piatt's (25 1| principle of "strong inference" 
which is an extension of Chamberlin's |252| "method of 
multiple working hypothesis" . By multiple model I do 
not mean models hierarchically nested such that each 
one is a special case of the model at the next level. By 
the term multiple model, I mean truly competing models 
that may, however, overlap partially. The relative scores 
of the competing models (and the corresponding underly- 
ing hypothese) would be a true reflection of their merits. 
For example, in case of a closely related systems in cell bi- 
ology and systems biology, experimental data have been 
analy zed recently within the framework of this method 
[253J . In the case of molecular machines, a method for 
model selection has been developed 254] ; it extracts the 
best model by optimizing the maximum evidence, rather 
than maximum likelihood, that treats, for example, the 



number of discrete states of the system as a variational 
parameter. This is a step in the right direction. However, 
I am not aware of any work that ranks alternative models 
according to relative scores computed on the basis of the 
principle of strong inference. 

The molecular mechanism of muscle myosin and the 
structure of double-stranded DNA were both discovered 
in the 1950s. These discoveries set the stage for the re- 
search on cytoskeletal motors as well as on machines that 
make, break and manipulate nucleic acids and proteins. 
Last 50 years has seen enormous progress. I wanted to 
tell you about the Pis and their contributions during this 
glorious period. But, I have run out of space. So, I' ll nar- 
rate this fascinating story in detail elsewhere (255| . 
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